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ABSTRACT 

We study the interface between galaxies and the intergalactic medium by measuring the absorption by neutral 
hydrogen in the vicinity of star-forming galaxies at z ss 2.4. Our sample consists of 679 rest-frame-UV selected 
galaxies with spectroscopic redshifts that have impact parameters < 2 (proper) Mpc to the line of sight of one 
of 15 bright, background QSOs and that fall within the redshift range of its Lya forest. We present the first 2-D 
maps of the absorption around galaxies, plotting the median Lya pixel optical depth as a function of transverse 
and line of sight separation from galaxies. The Lya optical depths are measured using an automatic algorithm 
that takes advantage of all available Lyman series lines. The median optical depth, and hence the median 
density of atomic hydrogen, drops by more than an order of magnitude around 100 kpc, which is similar to 
the virial radius of the halos thought to host the galaxies. The median remains enhanced, at the > 3<r level, 
out to at least 2.8 Mpc (i.e. > 9 comoving Mpc), but the scatter at a given distance is large compared with the 
median excess optical depth, suggesting that the gas is clumpy. Within 100 (200) kpc, and over ±165 km s -1 , 
the covering fraction of gas with Lya optical depth greater than unity is 100l~3 2 % (66 ± 16%). Absorbers 
with TL yct > 0.1 are typically closer to galaxies than random. The mean galaxy overdensity around absorbers 
increases with the optical depth and also as the length scale over which the galaxy overdensity is evaluated 
is decreased. Absorbers with T^ ya ~ 1 reside in regions where the galaxy number density is close to the 
cosmic mean on scales > 0.25 Mpc. We clearly detect two types of redshift space anisotropies. On scales 
< 200 kms -1 , or < 1 Mpc, the absorption is stronger along the line of sight than in the transverse direction. 
This "finger of God" effect may be due to redshift errors, but is probably dominated by gas motions within or 
very close to the halos. On the other hand, on scales of 1.4 - 2.0 Mpc the absorption is compressed along the 
line of sight (with > 3<r significance), an effect that we attribute to large-scale infall (i.e. the Kaiser effect). 
Subject headings: galaxies: formation — galaxies: halos — galaxies: high-redshift — intergalactic medium 
— quasars: absorption lines — large-scale structure of Universe 



1. INTRODUCTION 

Gas accretion and galactic winds are two of the most 
important and poorly understood ingredients of models for 
the formation and evolution of galaxies. One way to 
constrain how galaxies get their gas, and to learn about 
the extent of galactic feedback, is to study the intergalac- 
tic medium (IGM) in the galaxies' vicinity. The inter- 
face between galaxies and the IGM can be studied ei- 
ther in emission (e.g. i Bland & Tullvl [1981 jLehnert et al l 
1999b lRyan-Weberll2006l: iBorthakur et al.ll2010t ISteidel et all 
201 lb or in absorption against the co ntinuum of back- 
groun d objects such as QSOs (e.g.. lLanzetta & Bowed 
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Emission from intergalactic gas is very faint and obser- 
vations are currently mostly limited to low redshifts. At 
z ss 2.7 Lya emission was recently seen out to ~ 80 kpc from 
star-fo rming galaxies in a stacking analysis by ISteidel et al.1 
(120111) . and the origin of this light seems to be radiation of the 
central object scattered by galactic halo gas. However, these 
observations are limited to the immediate vicinity of galaxies, 
i.e. s» 100 kpc, and are currently feasible only for the H I Lya 
transition. On the other hand, studying this gas in absorption 
is viable at all redshifts as long as there are sufficiently bright 
background objects. Most importantly, absorption studies are 
sensitive to gas with several orders of magnitude lower den- 
sity than emission studies. 

The background sources used for absorption line probes of 
the IGM have traditionally been QSOs, but one may also use 
background galaxies or gamma ray bursts. While the surface 
density of background galaxies is much higher than that of 
QSOs, the quality of the individual spectra is much lower, 
since at the redshifts discussed in this paper the typical back- 
ground galaxy is > 10 3 times fainter than the brightest QSOs. 
Studies using galaxies as background sources are therefore 
confined to analyzing strong absorption lines and generally 
require stacking many lines of sight. On the other hand, QSOs 
sufficiently bright for high-resolution, high S/N spectroscopy 
using 8m-class telescopes are exceedingly rare, but the infor- 
mation obtained from a single line of sight is of exceptional 
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quality, well beyond what could be obtained with even the 
very brightest galaxy at comparable redshift. We note also 
that galaxies and QSOs do not provide identical information, 
as the former are much more extended than the latter. This 
is particularly relevant for metal lines, which often arise in 
absorbers with sizes that are c omparable or small er than the 
half-light radii of galaxies (e.g. lSchave et al.|[2007l) . 

In this paper we focus on studying the IGM near star- 
forming galaxies at 2.0 < z < 2.8 using absorption spec- 
tra of background QSOs. At present we focus on H I Lya 
in the vicinity of galaxies, while a future paper will study the 
relation between metals and galaxies. Star-forming galaxies 
in this redshift range can be d etected very efficiently b ased 
on their rest-frame UV colors dSteidel et alj|2003l I2004T) and 
the same redshift range is ideal for studying many astrophysi- 
cally interesting lines that lie in the rest-frame UV part of the 
spectrum (e.g. Lya at 1215.67 A, CIV AA 1548.19,1550.77 
A, OVI AA103 1.93, 1037.616 A, where the CIV and OVI lines 
are doublets). At these redshifts the Lya forest is not as sat- 
urated as at z > 4, and at the same time the absorption sys- 
tems are not as rare as they are at low redshifts. In addition, 
this redshift range is exceptionally well-suited for studying 
the galaxy-IGM interface given that t his is when the univ ersal 
star formation rate was at its peak (e.g. lReddy et al.l2008h . and 
hence we expect the interaction between galaxies and their 
surroundings to be most vigorous. 

The larg est QSO-galaxy surveys at hig h redshift are 
those f rom TAdelberger etafl d2003l [20051) and lCrighton etalj 
(1201 ll) . and here we will briefly describe the former given 
that it is the most comparable in terms of the galaxy sampling 
density and data quality to the study presented in this paper. 
Adelb erger et alj d2003l) studied the IGM close to 431 Lyman 
Break Galaxies (LBGs) at z « 3 - 4 in 8 QSO fields. They 
found enhanced Lya absorption within ss 1 — 6 h^ 1 comoving 
Mpc (ss 0.3— 2.1 proper Mpc) from star-forming galaxies. On 
the other hand, they found that the region within < 0.5 h~ x 
comoving Mpc (sa 0.2 proper Mpc) contains less neutral hy- 
drogen than the global av erage. This last result wa s, however, 
based on only 3 galaxies. Adelberg er et al.1 d2005) studied the 
IGM at 1.8 < z < 3.3 with an enlarged sample: 23 QSOs in 
12 fields containing 1044 galaxies. They confirmed the earlier 
result of enhanced absorption within < 7h~ l comoving Mpc 
(« 2.4 proper Mpc) and found that even though most galaxies 
show enhanced absorption within < 0.5 hr 1 comoving Mpc, 
a third of their galaxies did not have significant associated 
Lya absorption. 

In comparison with [AdelbergeritiD d200l |2005) we use 
uniformly excellent QSO spectra taken with the HIRES 
echelle spectrograph, cove ring from ^3 100 A to at least the 
QSO's CIV emission line. lAdelberger et alj (120031) also used 
HIRES spectra, but at z > 3 the surface density of galaxies 
with R s < 25.5 is ~ 4—5 times smaller than at z = 2.4, 
and the QSO spectra did not cover higher order Lyman lines, 
which are impor tant for the recovery of o ptical depths in satu- 
rated Lya lines. Adelber ger et alj (12005) used HIRES spectra 
as well as lower-resolution spectra of fainter QSOs obtained 
using LRIS and ESI. 

This paper is organized as follows. We describe our data 
sample in Section [2] In Section [3] we discuss the so-called 
pixel optical depth method that we use to analyze the QSO 
spectra. The distribution of Lya absorption as a function of 
transverse and line of sight (LOS) separation from galaxies 
is presented in Section |4l while the distribution of galaxies 



around absorbers is described in Section [5] Finally, we con- 
clude in Section[6] 

Throughout this work we us e fl m = 0.258, = 0.742, 
Q h = 0.0418, and h = 0.719 dKomatsu et alJl2009h . When 
referring to distances in proper (comoving) units we denote 
them as pMpc (cMpc). 

2. DATA 

2.1. The Keck Baryonic Structure Survey 

The Keck Baryonic Structure Survey (KBSS; Steidel et 
al 2012) is a new survey which combines high precision 
studies of the IGM with targeted galaxy redshift surveys 
of the surrounding volumes, expressly designed to estab- 
lish the galaxy /IGM connection in the redshift range 1.8 < 
z < 3. The KBSS fields are centered on 15 background 
QSOs (see TableQ]) that are among the most luminous known 
(Lboi > 10 14 L Q ) in the redshift range 2.5 < z < 2.85. The 
QSO redshifts were chosen to maximize the information con- 
tent and redshift path sampled by their absorption line spectra. 
Toward that end, the QSO spectra used in the KBSS are of un- 
precedented quality, combining archival high resolution spec- 
tra from Keck/HIRES (and, in 3 cases, archival VLT/UVES 
spectra as well) with new Keck/HIRES observations to pro- 
duce the final co-added spectra summarized in Table Q] 

The mean flux in this sample of QSO spectra is 0.806. 
The formula for the effective Lya optical depth from 
ISchaye et alJ(|20 03) suggests that the mean flux at z — 2.36 is 
0.802(±0.008), which is consistent with our data, so we con- 
clude that the IGM probed by QSO sightlines in our sample is 
representative for the considered redshift. 

Wit hin each KBSS field, UV color se lection techniques 
dSteidel et alJl2003b lAdelberger et aT1l2004 were used to tune 
the galaxy redshift selection function so as to cover the same 
optimal range of redshifts probed by the QSO spectra. The 
spectroscopic follow-up using Keck/LRIS was carried out 
over relatively small solid angles surrounding the QSO sight- 
lines, but typically 8 to 10 slit masks with nearly identical 
footprint were obtained in each field, leading to a high level 
of spectroscopic completeness and a very dense sampling of 
the survey volumes surrounding the QSO sightlines. The full 
KBSS galaxy sample contains 2188 spectroscopically iden- 
tified galaxies in the redshift range 1.5 < z < 3.5, with 
(z) = 2.36 ± 0.42. The spectroscopic sample was limited 
to those with apparent magnitude 1Z < 25.5, where galaxies 
in the range 23 < 1Z < 24.5 and those within 1 arcminute of 
a QSO sightline were given highest priority in designing slit 
masks. 

The effective survey area is 720 arcmin 2 (i.e. 0.2 square 
degrees), and the average surface density of galaxies with 
spectroscopic redshifts is 3.1 arcmin -2 (within 2.5 pMpc 
from QSO sightlines). 

In this paper, we focus on a subset of 679 galaxies that sat- 
isfy the following criteria for redshift and projected distance 
from the relevant QSO sightline: 

1) their redshifts are within the Lya forest range, defined 
as: 

. nAlv/3 . 3000kms _1 
(1 + zqso)t 1 < z < zqso-(1 + 2qso) > 

ALya C 

(1) 

where zqso is the QSO redshift, and AL yQ and X-^ y p 
are the rest-frame wavelengths of the hydrogen Lya 
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Table 1 

QSO fields. 



QSO a 


z b 


Gab € 






A min f [A] 


A ma x g [A] S/N H I h 


Afi' 






Ngal J 


Q0100+13 


2.7210 


16.6 


2.196 


2.654 


3125 


8595 


67 


5.6' 


X 


7.6' 


35 


Q0105+1619 


2.6518 


16.9 


2.102 


2.603 


3225 


6041 


112 


5.4' 


X 


7.4' 


59 


Q0142-09 


2.7434 


16.9 


2.210 


2.671 


3100 


6156 


65 


5.4' 


X 


7.4' 


45 


Q020/-U03 


2.8726 


16.7 


2.264 


2.848 


3100 


9220 


71 


5.4 


X 


7.0' 


38 


Q0449-1645 


2.6840 


17.0 


2.094 


2.651 


3150 


5966 


67 


5.0' 


X 


6.5' 


51 


Q0821+3107 


2.6156 


17.3 


2.150 


2.584 


3235 


5967 


42 


5.4' 


X 


7.4' 


32 


Q 1009+29 


2.6520 


16.0 


2.095 


2.593 


3160 


7032 


87 


5.2' 


X 


7.2' 


29 


Q1217+499 


2.7040 


17.1 


2.122 


2.680 


3075 


7000 


64 


5.1' 


X 


6.9' 


37 


Q 1442+2931 


2.6603 


17.0 


2.078 


2.638 


3155 


6150 


83 


5.4' 


X 


7.5' 


44 


Q1549+1933 


2.8443 


16.3 


2.238 


2.814 


3160 


9762 


164 


5.2' 


X 


7.1' 


44 


HS 1603+3820 


2.5510 


15.9 


1.974 


2.454 


3185 


9762 


87 


5.4' 


X 


7.2' 


41 


Q1623-KP77 


2.5353 


17.4 


1.983 


2.505 


3125 


6075 


43 


16.1' 


X 


L1.6' 


54 


Q 1700+64 


2.7513 


16.1 


2.168 


2.709 


3145 


9981 


92 


11.5' 


X 


L1.0' 


64 


Q2206-199 


2.5730 


17.5 


2.009 


2.541 


3047 


10088 


73 


5.4' 


X 


7.5' 


46 


Q2343+12 


2.5730 


17.0 


2.012 


2.546 


3160 


10087 


57 


22.5' 


X 


8.5' 


60 



a QSO name 
b QSO redshift 

c QSO AB magnitude (Sloan g' band) 
^ Minimum galaxy redshift 
e Maximum galaxy redshift 

Minimum wavelength covered 
g Maximum wavelength covered 
k Median S/N per pixel in the analyzed Lya region 
1 Spectroscopically observed area 
J Number of galaxies with spectroscopic redshifts 

(1215.67A)andLy/3 (1025.72A) lines, respectively (for 
a discussion of the limits in this expression, see {32. 2k 

2) they are within 2 pMpc (~ 4' or ~ hhT x cMpc at z = 
2.4), the transverse distance to which there is significant 
coverage in all 15 KBSS fields. 

Figure 1 shows the number of galaxies as a function of their 
(proper) distance from the QSO sightlines. 

We note that 3 of the 15 KBSS fields (1623+268, 1700+64, 
2343+12) were also included in the study by Adelberger et 
al (2005). However, since 2005 the data have been increased 
in both quantity and quality (for both the QSO spectra and 
the galaxy surveys) in all 3 of these fields. Further details on 
the KBSS survey and its data products will be described by 
Steideletal(2012). 

The smallest impact parameter in the present sample is 55 
pkpc, with 29, 106, and 267 galaxies having impact parame- 
ters smaller than 200, 500, and 1000 pkpc, respectively. 

2.1.1. Redshifts 

The redshifts of the vast majority (608 out of 679) of the 
galaxies in our sample are derived from interstellar absorp- 
tion lines and/or the Lya emission line. These lines were 
measured from low -resolution (FWHM m 370 km s _1 ) multi- 
slit spectra taken between 2000 and 2010 with LRIS-B on the 
Keck I and II telescopes. However, ideally one would want to 
measure the galaxy redshifts from the nebular emission lines 
([O II] A3727, Ha, H/3, [O III] AA4959, 5007) since those 
originate in stellar H II regions and are mor e likely to corre- 
spond to the galaxies' systemic redshifts. lErb et al.1 (|2006b) 
have used NIRSPEC, a near-IR instrument on Keck II, to ob- 
tain higher resolution (FWHM « 240 kms -1 ) spectra than 
achieved by LRIS-B and have used these to measure nebular 
redshifts for 110 galaxies. Our sample contains 71 of those 
galaxies and we take their systemic redshifts to be equal to 
the nebular redshifts, i.e. z ga i = z nc b- Marginally resolved 
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Figure 1. Number of galaxies as a function of proper distance from the line 
of sight to the background QSO. We do not use galaxies with impact param- 
eters larger than 2 pMpc since such galaxies were only targeted for a fraction 
of our fields. The distance bins are chosen to match the bins that are used for 
most of the figures in this paper (bins are separated by 0.15 dex). 



lines have a w 160 kms 1 for LRIS-B and a ss 100 kms 1 
for NIRSPEC. 

For the galaxies without near-IR observations we esti- 
mate the systemic redshifts using the empirical relations of 
iRakic et ail (1201 11) . Using the same data as analyzed here, 
they calibrated galaxy redshifts measured from rest-frame UV 
lines (Lyman-a emission, z-^ ya , and interstellar absorption, 
£ism) by utilizing the fact that the mean Lya absorption pro- 
files around the galaxies, as seen in spectra of background 
QSOs, must be symmetric with respect to the true galaxy red- 
shifts if the galaxies are oriented randomly with respect to the 
lines of sight to the background objects. The following values 
represent their best fits to the data: 

a) for galaxies without available interstellar absorption 
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Figure 2. Redshift distribution of QSOs (vertical lines) and galaxies that 
are used in our study, i.e. of galaxies that are in the Lyman-a: forest redshift 
ranges of QSOs in their fields and that have impact parameters < 2 pMpc. 
The black histogram shows the redshifts of all galaxies (679 objects in to- 
tal), and the grey histogram just those with redshifts measured from near-IR 
spectra (7 1 galaxies). 



lines (70 objects): 

— ^Lyo 



295±|| kms~ 



(2) 



b) for galaxies without detected Lya emission lines (346 
objects): 



^gai = Zism + 145±3" kms 1 



(3) 



c) for galaxies for which both interstellar absorption and 
Lya emission are detected (263 objects) we use the 
arithmetic mean of the above expressions. 

These offsets yield redshift estimates free o f velocity sys- 
tematics dRakic et al.H201 U iRudie et al.ll2012l) . Repeated ob- 
servations of the same galaxies suggest that the typical mea- 
surement uncertainties are s» 60 kms -1 for z nc b- Compari- 
son of redshifts inferred from rest-frame UV lines with those 
from nebular lines for the subset of 89 galaxies that have 
been observed in the near IR, shows that the rest-frame UV 
infer red redshifts have a scatter of sa 130 km s -1 . In addi- 
tion, IRudie et ail (120121) concluded that redshift errors in the 
KBSS sample are < 160 km s -1 based on the velocity struc- 
ture of the strongest H I absorbers around galaxy positions 
(first panel of their Figure 18), and lTrainor & Steidell (12012b 
reached the same conclusion based on the small observed ve- 
locity dispersion of galaxies with respect to QSOs. 

Figure|2]shows a histogram of the galaxy redshifts, both for 
the full sample and for the subsample with redshifts measured 
from near-IR lines. The median redshifts of the full and near- 
IR samples are 2.36 and 2.29, respectively. The mean and 
standard deviations of the redshift distributions for the two 
samples are 2.36 ± 0.17 and 2.28 ± 0.13, respectively. 

2.1.2. Physical properties 

These UV-selected star-forming ga laxies have stella r 
masses (logi O M*/M ) = 10.08 ±0.51 (iShaolev et al.12 005). 
The typical star formation rates (SFRs) are ~ 30 M Q yr^ 1 , 
where the SFRs of individual objects vary from sa 7 to sa 200 
Mq yr _1 , and the mean SFR surface density is (Esfr) = 2.9 
Mq yr _1 kpc -2 (lErb et al.l 12006 ) . The se stellar mass and 
SFR estimates assume a ChabrieJ([200! IMF. The galaxies 
show a correlation between their stellar mass and metallicity, 



but the relation is offset by 0.3 dex as compared to the lo- 
cal relation, with the s ame stellar mass galaxies having lower 
metallicity at z sa 2.4 (Erb et al. 2006c). Typical metallicities 
range from sa 0.3 Z Q for galaxies with (M*) = 2.7 x 10 9 M Q 
to sa Z Q for galaxies with ( M») = 1 x 10 11 M Q . 

As discussed in section 12.1.11 ISM absorption lines are 
almost always blue-shifted with respect to the galaxy sys- 
temic redshift, and the Lya emission line is always redshifted. 
These observed velocity offsets suggest that galaxy-scale out- 
flows, with velocities of hundreds of km s _1 , are the norm in 
th ese star-forming g a laxies. 

iTrainor & Steidell d2012l) use the MultiDark simulation 
(Prada et all 1201 lh 7 together with a clustering analysis, to 
connect galaxies from KBSS to dark matter halos. They 
find that this type of galaxy resides in halos with masses 
above 10 117 Mq, with a median halo mass of ~ 10 119 M^ 
(similar conclusions were reached by e.g. Adelberg er et al.l 
2005b; IConrov et al.l 120081) . The corresponding virial radii 
are sa 75 and sa 87 pkpc, respectively, with circular veloc- 
ities sa 171 km s -1 and sa 198 km s -1 . 

2.2. QSO spectra 

The typical resolution of the QSO spectra is R sa 36, 000, 
and they were rebinned to pixels of 2.8 km s -1 . The spec- 
tra were reduced using T. Barlow's MAKEE package and the 
continua were normalized using low-order spline fits. Further 
details about the QSO observations will be given in Steidel 
et al. (in preparation). The QSO redshift distribution can be 
seen in Figure [2] and a summary of the properties of the final 
spectra is given in Table [T] 

The redshift range that we consider when studying Lya ab- 
sorption in the spectrum of a QSO at redshift zqso is given by 
equation (HJ. The lower limit ensures that only Lya redwards 
of the QSO's Ly/3 emission line is considered, thus avoiding 
any confusion with the Ly/3 forest from gas at higher redshifts. 
The upper limit is set to avoid contamination of the Lya for- 
est by material associated with the QSO and to avoid the QSO 
proximity effect. We verified that excluding 5,000 rather than 
3,000 kms -1 gives nearly identical results. 

The median S/N in the Lya forest regions of the spectra 
ranges from 42 to 163 (Table[TJ. 

A number of QSO spectra (5 out of 15) contain damped 
Lyman-a systems (DLAs) and sub-DLAs within the consid- 
ered redshift range. We divided out the damping wings af- 
ter fitting the Voigt profiles (see IRudie et aD 120 121 for more 
details on this procedure). The saturated cores of the (sub- 
)DLAs were, however, flagged and excluded from the analy- 
sis, but this does not have a significant effect on our results. 

3. PIXEL OPTICAL DEPTH 

The goal of the pixel optical depth (POD) method is to use 
automatic algorithms (as opposed to manual fits) to find the 
best estimate of the optical depth of the H I Lya absorption 
line and those of various metal transitions as a function of 
redshift. Obtaining accurate pixel optical depths can be chal- 
lenging due to the presence of noise, errors in the continuum 
fit, contamination and saturation. 

It is more instructive to plot optical depths than normalized 
fluxes because the optical depth is proportional to the column 
density of the absorbing species, while the flux is exponen- 
tially sensitive to this density. Hence, a power-law column 
density profile translates into a power-law optical depth pro- 
file and a lognormal column density distribution becomes a 
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Figure 3. Lower panel: Recovered H I Lya pixel optical depth (black dots) as a function of redshift for the spectrum of Q1549+1933. The black dots at 
Tm = 10 -5 are pixels with normalized flux greater than unity that have their values set to r m ; n (see the text for details). The points at 10 4 show pixels that are 
saturated and that have insufficient available higher order lines for the recovery of their optical depths. The solid vertical line indicates the redshift of the QSO. 
The dashed vertical line, which is separated from the solid line by 3,000 kms , indicates the maximum redshift we consider for this spectrum. Upper panel: 
Impact parameters of galaxies in this field (red stars) as a function of spectroscopic redshift. 



lognormal optical depth distribution. The shapes of the pro- 
files and distributions common in nature are therefore pre- 
served, which is not the case if we work with the flux. Of 
course, if the spectral resolution is coarser than the typical 
line width, or if the wavelength coverage does not allow for 
the use of weaker components of multiplets to measure the 
optical depth of saturated lines, then the interpretation of the 
recovered optical depths is more complicated. Our spectral 
resolution is, however, sufficient to resolve the Lya lines and 
our wavelength coverage is sufficient to recover the optical 
depth of most saturated lines. 

Another advantage of optical depths is that their dynamic 
range is unconstrained, whereas the normalized flux is con- 
fined to vary between zero and one. The large dynamic range 
does imply that one must use median rather than mean statis- 
tics, because the mean optical depth merely reflects the high 
optical depth tail of the distribution. While this problem could 
also be solved by using the mean logarithm of the optical 
depth, that would not resolve another issue resulting from the 
large intrinsic dynamic range: the observed optical depth dis- 
tribution is cut off at the low absorption end by noise and at 
the high end by line saturation (and thus ultimately also by 
noise). However, errors in the tails of the distribution do not 
matter if we use median statistics (or other percentiles, pro- 
vided they stay away from the noise). 

The POD sta t istics method was introduce d by 
ICowie & Songailal d 19981) (see also ISongailal 119981) and 



further imp roved by Ellison et ail (120001) . ISchave et al.l 
( 2000b]), and lAguirre et alJ (120021) TlTie POD method used in 
this st udy is the one developed and tested b y Aguirr e et alJ 
(2002), and is identical to that of lEllison etaLl (120001) for the 
case of Lya. 

The H I Lya optical depth in each pixel is calculated from 
the normalized flux F(A), 



T LyQ (A) = -ln(F(A)). 



(4) 



Pixels with TL yQ < (which can occur in the presence of 
noise) are assigned an optical depth of r m i n = 10~ 5 , which is 
smaller than any recovered optical depth. 



Pixels that have F(X) < N a a\ are considered saturated, 
where a\ is the rms noise amplitude at the given pixel and 
N a is a parameter that we set to 3 (see Aguirre, Schaye & 
Theuns, 2002 for more details). One advantage of the POD 
method is that it is easy to recover a good estimate of the Lya 
optical depth in these saturated pixels by using the available 
higher order Lyman lines. The recovered Lya optical depth is 
then given by 

r Lya = m i n { r Lyra/LyaALya//LynALyn}, (5) 

where /L yn is the oscillator strength of the nth order Lyman 
line and AL yn is its rest wavelength (n = 1 corresponds to 
a, n = 2 to f3, etc.). Taking the minimum optical depth 
(Equation |5]l minimizes the effect of contamination by other 
lines. Higher order lines used for Lya optical depth recov- 
ery are those that lie in the wavelength range of the spectrum, 
and for which N a a n < F(\-L yn ) < 1 — N a cr n , where a n is 
the noise at AL yn - If none of the available higher order lines 
satisfy this criterion, or if none of the higher order lines are 
available, then the pixel is set to a high optical depth (we use 
Tmax = 10 4 ). An example of the recovered optical depth in 
one QSO spectrum is shown in Figure [3] together with the 
positions of galaxies in the same field. 

Setting pixels to r m j n and r max allows them to correctly 
influence the median and other percentiles. The actual values 
they are set to are unimportant as long as T m - m < r£° c Q < r max 
for the percentile of interest. In other words, as long as the 
Train or r max are on the same side of a given percentile as the 
true optical depth that they are replacing, their actual values 
are unimportant. 

We use the POD method because: a) It is fast, robust, and 
automatic, which means it can deal with large amounts of 
data, both observed and simulated; b) It makes it straight- 
forward to exploit the full dynamic range measurable from 
our spectra; c) The fact that the optical depth is directly pro- 
portional to the column density of n eutral hydrogen makes it 
easy to interpret the results (see §14.51 1. While the POD method 
has clear advantages, it is important to note that it does not 
replace the more traditional method of decomposing the ab- 
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sorption spectra into Voigt profile components. For example, 
in its simplest form, the POD method does not make direct 
use of information about the line widths, which are clearly of 
interest as they contain important information about the tem- 
perature and the small-scale velocity structure of the absorb- 
ing gas. An analysis of the KB SS data based on V oigt profile 
decompositions is presented in R udie et ail (12012b . 

4. LYa ABSORPTION NEAR GALAXIES 

After recovering the optical depths in each pixel of a QSO 
spectrum, we can build a map of the galaxies' average sur- 
roundings, as a function of the transverse distance from the 
line of sight (LOS), i.e. the impact parameter, and the veloc- 
ity separation from the galaxy along the LOS. We sometimes 
convert velocity separation into distance by assuming that it 
is du e to the Hub ble flow. However, as we will show in sec- 
tions [4J] and [4j2] this is a poor approximation at small veloc- 
ity differences from galaxies. We will refer to distances com- 
puted under the assumption of zero peculiar velocities (and 
zero redshift errors) as "Hubble distances". 

Different galaxies and QSO fields are combined, meaning 
that the median POD for a given impact parameter and ve- 
locity difference is estimated over all galaxies irrespective of 
which field they come from, without applying any weighting. 
Each galaxy therefore provides an array of pixels with varying 
velocity difference but fixed impact parameter. 

For reference, we note that 0.2, 2.1, and 9.9 percent of the 
pixels in our sample are at 3-D Hubble distances smaller than 
200, 500, and 1000 pkpc, respectively. 

4.1. 2-D Maps of the median Lya absorption 

The left panel of Figure [4] shows the logarithm of the me- 
dian TL yQ as a function of the transverse and the LOS separa- 
tions from galaxies. The distance bin size in this plot is 200 
by 200 pkpc (which corresponds to w 46 kms -1 along the 
LOS), and the image has been smoothed with a 2-D Gaus- 
sian with FWHM equal to the bin size. Note that we take 
a galaxy-centered approach in constructing this image. Each 
galaxy contributes a column of pixels whose position along 
the x-axis corresponds to the galaxy's impact parameter. A 
single pixel can be used multiple times: once for each galaxy 
whose separation from the pixel falls within the range plotted. 
This is the first publis hed 2-D absorption map a round galax- 
ies. S t udies by e.g. lAde lberger et al. (2003J); iRvan- Weber 
(2006); IWilmanetalJ (|2007|) and LShone et al. (2010) show 
cross-correlation maps of galaxies with Lya absorption sys- 
tems, which is not equivalent. The map shows a strong cor- 
relation between the Lya absorption strength and the distance 
to the galaxies. 

The right panel of Figure [4] shows the same data, after ran- 
domizing the galaxy redshifts. We randomize the galaxy red- 
shifts within the Lya forest region of each QSO spectrum 
while keeping their impact parameters unchanged in order to 
preserve the number of pixels per galaxy. In this way we can 
estimate the magnitude of the fluctuations in the absence of 
correlations between the locations of galaxies and absorbers. 
We can see that the signal is lost, which implies that the fea- 
tures seen in the left panel are real. Finally, we note that 
because a single galaxy contributes a full column of pixels 
at its impact parameter, bins along the LOS are somewhat 
correlated. On the other hand, bins in the transverse direc- 
tion are independent. In Appendix [B] we demonstrate that 
along the LOS the errors are significantly correlated for scales 
^lO^ms" 1 . 



In Figure|4]we kept the positive and negative velocity differ- 
ences between absorbers and galaxies separated, which gives 
insight into the amount of noise and sample variance. How- 
ever, given that the Universe is statistically isotropic^ (and our 
observations are consistent with this assumption), we can in- 
crease the S/N by considering only absolute velocity differ- 
ences. Figure [5] shows a map of gas around galaxies by tak- 
ing into account only absolute velocity separations. We use 
logarithmic axes, which brings out the small-scale anisotropy 
more clearly. 

The most prominent feature is the region of strongly en- 
hanced absorption that extends to ~ 10 2 pkpc in the trans- 
verse direction, but to ~ lpMpc (~ 200 km s -1 ) along the 
LOS. This redshift space distortion (called the "finger of God" 
effect in galaxy redshift surveys) could be due to redshift 
errors and/or peculiar velocities. A more subtle anisotropy 
is visible on large scales. In the transverse direction the 
correlation between absorption strength and galaxy position 
persists out to the maximum impact parameter we consider, 
2 pMpc, whereas it becomes very weak beyond « 1.5pMpc 
(w 300 km s -1 ) along the LOS, as is most clearly visible in 
Figure |4] If real, su ch a feature w ould imply infall of gas on 
large scales, i.e. the Kaisej (119871) effect. Other scenarios are 
unlikely, as the signal distortion along the LOS can only be 
caused by redshift errors and peculiar velocities. However, 
redshift errors would act to spread the signal along the LOS 
rather than compressing it, which leaves peculiar velocities as 
the only other possibility. We will examine the significance of 
these anisotropies in the next section. 

4.2. Redshift space distortions 

Figure|6]shows "cuts" along the first 9 rows and columns of 
the 2-D map shown in Figure|5] spanning 0-2 pMpc. The red 
circles and grey squares show the profiles along the transverse 
and LOS, respectively. As indicated in the insets, these cuts 
correspond, respectively, to the horizontal and vertical strips 
in Fig. [5] Observe that the transverse and LOS directions are 
identical for the n th data point in the ?i th panel, which corre- 
sponds to the intersection of the horizontal and vertical strips 
shown in the insets. In other words, where the horizontal and 
vertical strips meet, the data point is replicated as both a red 
and a black symbol. Note also that the n th red circle (black 
square) of the m th panel is identical to the m th black square 
(red circle) of the n th panel. The figure thus contains redun- 
dant information. For example, the LOS direction (i.e. black 
squares) in the 1 st panel shows all the 1 st data points appear- 
ing in the transverse direction (i.e. red circles) of the other 
panels. Similarly, the 2 nd panel shows all the 2 nd transverse 
data points, etc. 

As LOS separations have been computed under the assump- 
tion of pure Hubble flow, any significant difference between 
the red and black curves must be due to redshift space distor- 
tions (assuming the Universe is isotropic in a statistical sense). 
By comparing the two curves in each panel, we can there- 
fore identify redshift space distortions and assess their signif- 
icance. 

The error bars in Figure [6j as well as those in subsequent 

2 The Universe is denser at higher redshift and therefore we expect more 
absorption at positive velocity differences in Figuref4]than at negative veloc- 
ities, but this effect is negligible on the scales that we consider. The mean 
transmission varie s by m 2 X 10~ 3 over 10 3 kms — 1 at 2 = 2.4 (e.g. 
Schaye et al. 2003), which is much smaller than the excess absorption in the 
vicinity of galaxies. 



HI near galaxies at z « 2.4 




Figure 4. Le/f: Median H I Lyct absorption as a function of transverse and LOS distance from (z) fa 2.4 stai'-forming galaxies. The bin size is 200 pkpc and the 
images have been smoothed by a 2-D Gaussian with FWHM equal to the bin size. The lower limit of the color scale corresponds to the median optical depth of all 
pixels (logio = —1.27), while the upper limit is set to logio = 0.5 (very close to the maximum optical depth in this map, logio = 0.47). Absorption is clearly 
enhanced close to galaxies, out to at least 2 pMpc in the transverse direction, but only out to Rd 1.5 pMpc along the LOS. This anisotropy suggests large-scale 
infall of gas. On the other hand, on small scales the absorption declines more rapidly in the transverse direction than in the LOS direction. Right: Results after 
randomizing the galaxy redshifts while keeping their impact parameters fixed. The fact that the correlation does not vary systematically with distance indicates 
that the features in the left panel are real. 



Table 2 

Data from Figure|3] the median logioTL ya as a function of impact parameter (rows) and distance along the LOS (columns), with the lcr confidence 

interval. 



Distance [pMpc] 0.00-0.13 0.13-0.18 0.18-0.25 0.25-0.36 0.36-0.50 0.50-0.71 0.71-1.00 1.00-1.42 1.42-2.00 
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Figure 5. Similar to Fig. \4\ but we do not separate positive and negative 
velocity differences between absorbers and galaxies and we use logarithmic 
bins and axes (bins are separated by 0.15 dex). The map is smoothed by a 
2-D Gaussian with FWHM equal to the bin size. The color scale is saturated 
at the high optical depth end (at the smallest impact parameters the optical 
depth is logio ~ 1.15). 

plots, were computed by bootstrapping the galaxy sample 
1,000 times. That is, for each bootstrap realization we ran- 
domly select galaxies, where each galaxy could be selected 
multiple times, until the number of galaxies from the original 
set is reached. We calculate the results for each realization and 
the errors show the lcr confidence interval. As demonstrated 
in Appendix iBl along the LOS the errors are correlated for 
separations < 10 2 kms -1 (i.e. black squares in each panel 
are correlated on these scales), but not in the transverse direc- 
tion (i.e. red circles in each panel are independent). 

The number of galaxies per transverse bin depends slightly 
on the velocity difference, because galaxies separated by 
Aw kms -1 from the Lya forest region (as defined in sec- 
tion |Z2]i will still contribute pixels to bins with velocity sep- 
arations greater than Aw. From small to large impact param- 
eters, the number of galaxies contributing to the 1 st (9 th ) ve- 
locity bin is 14 (16), 8 (8), 1 1 (12), 22 (23), 47 (48), 58 (62), 
96 (99), 175 (180), 202 (211). Thus, the black squares in 
the first panel, as well as the first red circle in all panels, are 
based on 14 - 16 galaxies. Similarly, the black squares in the 
9 th panel, as well as the last red circle in all panels, are based 
on 202 - 21 1 galaxies. 

The first panel, which shows cuts near the axes of Figure|5] 
clearly shows that out to ~ 1 pMpc, which translates into 
< 200 km s -1 along the LOS (see the top x-axis), the ab- 
sorption is stronger along the LOS (black squares) than in the 
transverse direction (red circles). In panels 2-7, which cor- 
respond to strips offset by 0.13 - 1.00 pMpc from the axes in 
Figure [5] the smearing in the LOS direction manifests itself as 
enhanced absorption at small impact parameters relative to the 
signal at small LOS separations. The confidence level associ- 
ated with the detected discrepancy between the two directions, 
which we estimate as the fraction of bootstrap realizations in 
which the sign of the discrepancy is reversed, is at least 99% 
for each data point out to 1 pMpc (i.e. points 2-7 of the first 
panel). 

The differences on scales < 1 pMpc (< 233km s _1 ) can 
be explained by two effects. Firstly, we expect gas in and 
around galaxy halos to have peculiar velocities comparable 
to the circular velocity of the halos, i.e. ~ 200 kms -1 (and 
possibly significantly h igher for outflowing gas). Secondly, 
as discussed in Section 12.1.11 there are random errors in the 



redshift measurements of sa 130 km s -1 for LRIS redshifts, 
and ss 60kms _1 for the NIRSPEC subsample. These two 
effects smooth the signal in velocity space on the scale that 
is a result of the combination of these velocities. In Ap- 
pendix|A]we show that redshift errors may be able to account 
for the observed smoothing. However, this does not prove the 
observed elongation along the LOS is due to redshift errors 
rather than peculiar velocities. In fact, since the elongation is 
more extended than the typical redshift errors (~ 200 kms -1 
vs. w 130 km s _1 ), but similar to the expected circular veloc- 
ities (« 200 km s _1 ), it is likely that the finger of God effect 
is dominated by small-scale peculiar velocity gradients due to 
virial motions, infall, and/or outflows. This would not be sur- 
prising considering the observations of large velocity field s 
in the gas in star- forming galaxies (e.g. ISteidel et alj|2010b . 
iRudie et all (120 121 ) use the KBSS sample to study Lya absorp- 
tion near galaxies by decomposing the forest lines into Voigt 
profiles, and they also detect the finger of God effect at small 
impact parameters. After presenting a more detailed analysis 
of the impact of redshift errors, they conclude that redshift er- 
rors alone cannot account for the detected elongation of the 
absorption signal along the LOS, which strengthens our con- 
clusion. It is also worth noting that even if redshift errors were 
solely responsible for the finger of God effect, we can still 
safely conclude that the detected peculiar motions of Lya ab- 
sorbing gas near galaxies do not exceed ~ 200 km s — 1 , How- 
ever, this does not mean that such gas does not reach higher 
velocities very close to galaxies, as the smallest impact pa- 
rameter in the KBSS sample is m 55 pkpc. 

On the other hand, at distances > 1.42 pMpc the situation 
is reversed: the absorption is compressed in the LOS direc- 
tion. For impact parameters 1.42 - 2 pMpc and LOS separa- 
tions 0-0.71 pMpc (0- 165 kms -1 ; 9 th (i.e. last) red circles 
in panels 1-6) the absorption is stronger than for impact pa- 
rameters 0- 0.71 pMpc and LOS separations 1.42 - 2 pMpc 
(9 th black square in panels 1-6). The same information is 
collected in the last panel, which corresponds to strips sepa- 
rated by 1.42-2.00 pMpc from the axes. The first six black 
squares are higher than the corresponding red circles, which 
implies that the absorption is enhanced along the LOS rela- 
tive to the direction transverse to the LOS. Observe that this 
enhancement is absent from all other panels. The confidence 
level with which this discrepancy is detected is, from small to 
large scales, 99.4%, 77.9%, 83.2%, 99.8%, 91.6% and 82.6% 
for points 1-6, respectively. As mentioned above, these con- 
fidence levels represent the fraction of bootstrap realizations 
in which the sign of the discrepancy is the same as for the 
original sample. 

In order to estimate the significance of the compression 
along the LOS, we combine different measurements of the 
difference between pairs of points. For each pair of points ; 
(points 1-6 in the last panel of Figure [6j, we first compute 
the difference between median optical depths, di = a,; — 
where as and 6, are the observed values, and the associ- 



ated error 



S a,i + S b,V 



where s„ 



and i are er- 
rors on measurements ojj and hi. After that we compute 
the weighted average difference for the 6 pairs of points, 

( d ) = Ei=i(4/ S i)/ YnZii 1 /^ )» and the corresponding er- 
ror, s 2 = 1/ X)i=i(l/ S i )■ The significance is then (d)/s (as- 
suming Gaussian errors). This procedure suggests that the 
compression along the LOS is detected at 3.5<7. 
This compression along the LOS can be explained by large- 
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Figure 6. Cuts through the unsmoothed 2-D map from Fig. [5] where red circles show median profiles in the transverse direction (i.e. parallel to the x-axis of 
Fig. ID, and grey squares show cuts along the LOS (i.e. parallel to the j/-axis of Fig. [5). The range of LOS separations and impact parameters included in the 
strips along the y- resp. x-axis of Fig.|5]is indicated in each panel. Distances along the LOS have been computed from the corresponding velocity interval under 
the assumption of pure Hubble flow. Hence, differences between the red and black curves indicate redshift space distortions. The dashed horizontal line shows 
the median Lya optical depth of all the pixels in the spectra. With the exception of the last red circles and the corresponding grey squares, which correspond to 
1.42 - 2.00 pMpc, the differences between the red and black curves in the first 8 panels indicate that the signal has been smoothed in the LOS direction, probably 
as a result of redshift errors and, most importantly, small-scale peculiar velocities. On the other hand, the differences revealed by the most distant points in the 
first six panels or, equivalently (see text), by the first six points in the last panel, imply compression along the LOS. This compression indicates the presence of 
large-scale infall onto the galaxies. Data plotted in this figure are tabulated in Table|5] 



scale infall, i.e. the Kaiser effect. As the absorbing gas must 
be cool, i.e. T ~ 10 4 K <C T vir , to be visible in HI, this 
form of gas acc retion could be called "cold accretion" (e.g. 
iKeres et afll2005l) . although we note that this term is most of- 
ten used in the context of cold streams within the virial radii 
of halos hosting galaxies. We conclude that we have an unam- 
biguous and highly significant detection of cool gas falling to- 
wards star-forming galaxies at z ps 2.4. The large-scale infall 
of gas onto haloes and its impact on the abso rption signal has 
been a topic of several theoretical studies (e.g. lKollmeier et al.l 
l2003HBarkanal l2004: Kim & Crofjl2008|) . and is also investi- 
gated in Rakic et al. (in preparation) where we will compare 
the observational results with cosmological hydrodynamical 
simulations. 

The dashed, horizontal lines in Figure|6]indicate the median 
Lya optical depth of all pixels in the Lya forest regions of the 
QSO spectra. In the transverse direction we do not probe suf- 



ficiently large distances to see the signal disappear: for all but 
the last panel the red curves stay above the dashed lines out 
to impact parameters of 2 pMpc. In the LOS direction (black 
curves) we do see convergence for separations > 3 pMpc. The 
last black square in the first panel is an outlier, but note that it 
is based on only 16 galaxies, whereas the same points in pan- 
els 5-9 are based on 49-217 galaxies. Indeed, according to the 
redshift randomization method described in the next section, 
the last black square in the first panel is only a 1.8<r outlier, 
whereas the last red circles of panels 1-6 (or, equivalently, the 
first 6 black squares in the last panel) represent detections of 
excess absorption with significance varying between 2.6<r and 

> 3(7. 

The median log(ioTL ya ) and ler confidence intervals from 
Figure [5] are tabulated in Table [2] 

4.3. Lya absorption as a function of 3-D Hubble distance 
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Figure 7. Median logioTXya as a function of 3-D Hubble distance from 
galaxies. Distance bins are separated by 0.15 dex. The dashed lines show 
the 15.9% and 84.1% percentiles, i.e. the la scatter of the PODs around the 
median. Note that the errors are correlated over scales < 10 2 kms -1 (see 
AppendixlBl. The horizontal dashed line shows the median of all pixels in the 
spectra. The grey shaded region shows the Icr detection threshold. It shows 
the la confidence interval for the median(logioTX yce ) that we obtain after 
randomizing the redshifts. The dotted line shows the number of galaxies in 
each distance bin (right y-axis). The blue curve shows the best-fit power-law: 
logiOTLya = 0.32d^' 92 — 1.27, where d^u is the 3-D Hubble distance. 

Figure[7]shows the median Lya optical depth in radial bins 
around the galaxy positions, where we assumed that velocity 
differences between absorbers and galaxies are entirely due to 
the Hubble flow. We emphasize that Figures[5]and[6]show that 
this is not a good approximation, particularly for distances 
< 1 pMpc. It does, however, provide us with a compact way 
to present a lot of information. 

The 3-D Hubble distance, as we call it, is therefore just 
g?3d = \/b 2 + (H(z)Av) 2 , where b is the galaxy's impact pa- 
rameter, H(z) is the Hubble parameter, and Av is the velocity 
separation betw een an absorber and the galaxy. As mentioned 
in Section 12.11 we use only galaxies with impact parameters 
smaller than 2 pMpc, even when making Figure [7] Hence, 
distances > 2 pMpc reflect mostly LOS separations. 

The horizontal dashed line shows the median level of ab- 
sorption in the Lya forest pixels. The significance of the 
excess absorption can be estimated by comparing the error 
bars, which indicate the la confidence intervals determined 
by bootstrap resampling the galaxies, to the difference be- 
tween the data points and the horizontal dashed line. More 
precisely, we can estimate the confidence level associated 
with the detection of excess absorption as 1 — 2/b.i ow , where 
/b.iow is the fraction of 1,000 bootstrap realizations for which 
the data point falls below the dashed line. This method indi- 
cates that within 2.8 pMpc excess absorption is detected with 
greater than 99.7% significance (i.e. > 3er). For 2.8 - 4.0 
pMpc the significance is 87% (i.e. 1.5a), while the absorption 
is consistent with random beyond 4 pMpc. 

The dotted curve and the right y-axis show the number of 
galaxies contributing to each bin. Since the inner few bins 
contain only a few tens of galaxies each (14 for the first bin), 
the bootstrap errors may not be reliable for these bins. The 



significance of the excess of absorption can be estimated more 
robustly by making use of the fact that each QSO spectrum 
provides many independent spectral regions at the impact pa- 
rameter of each galaxy. We can do this by comparing the ex- 
cess absorption to the grey region, which indicates the la de- 
tection threshold and which was determined by re-measuring 
the median logioTLya a ft er randomizing the galaxy redshifts 
(while keeping the impact parameters fixed). The grey shaded 
region shows the la confidence interval obtained after doing 
this 1,000 times. For each distance bin, the confidence level of 
the detection is then given by 1 — 2/ Z) hi g h> where / z ,high is the 
fraction of realizations resulting in a median optical depth that 
is higher than actually observed. In agreement with the errors 
estimated by bootstrap resampling the galaxies, we find that 
the significance is > 99.7% within 2.8 pMpc and that there is 
no evidence for excess absorption beyond 4 pMpc. For 2.8- 
4.0 pMpc the significance of the detection is, however, larger 
than before: 99.2% (i.e. 2.7cr). We conclude that the absorp- 
tion is significantly enhanced out to at least 3 pMpc proper, 
which is 7/i _1 cMpc. 
The fact that the absorption is enhanced out to several pMpc 

I I I I 

is in good agreement with Adelberger et al. (2005), who mea- 
sured the mean flux as a function of 3-D Hubble distan ce. 
However, the profile measured by Adelberg er et al.l ((2005) is 
much flatter. Converting their data into optical depths, they 
measure log 10 TL yQ w —0.1 in their innermost bin, which ex- 
tends to about 200 pkpc. This is about an order of magnitude 
lower than our median recovered optical depth at this distance. 
Conversely, at large distances their mean flux asymptotes to 
0.765, or log 10 T^ ya ss —0.57, which is much higher than 
our asymptotic median optical depth of log 10 T^ ya ps —1.27, 
even though we measure a similar mean flux of 0.806, or 
log 10 TL VQ ~ —0.67. Thus, our dynami c range is about 
two or ders of magnitude larger than that of Adelberger et al. 
(2005). This difference arises because we use median opti- 
cal depth rather than mean flux statistics and because we use 
higher order Lyman lines to recover the optical depth in satu- 
rated lines. 

The dashed curves show the 15.9% and 84.1% percentiles, 
indicating the la scatter in the PODs (which is obviously 
much larger than the error in the median). It is important to 
note that, except on the smallest scales (< 200 pkpc), the scat- 
ter is similar to or larger than the median excess absorption. 
Hence, there will be a wide range of PODs for all separations 
probed here. 

Finally, the blue line shows the best-fit power-law through 
the data points, 

median(log 10 T LyQ ) = (O^iO-OS)^' 92 ^' 17 - 1.27, (6) 

where we required the fit to asymptote to the median of all 
pixels (horizontal, dashed line in Figure 0. 

4.3.1. Testing the robustness 

In this section we will show that the results we presented 
(2-D and 3-D absorption profiles) are robust to changes in the 
S/N ratio of the QSO spectra, to the omission of NIRSPEC or 
non-NIRSPEC redshifts, to the exact redshift calibration, and, 
for optical depths <C 10, to the use of higher order Lyman 
lines. We have chosen to demonstrate this using the plots of 
median absorption versus 3-D Hubble distance, because these 
offer a compact summary of the data. 

In the left panel of Figure [8] we compare the median Lya 
absorption as a function of 3-D Hubble distance for the lower 



3-D Hubbl 

100 



e Velocity [km/s] 
00 1000 







: r j : : : : t ■ : 






| "Lower S/N 






. . j Higher S/N" 




1 














■ ■ * « 







HI near galaxies at z « 2.4 

3-D Hubble Velocity [km/s] 
100 1000 

3 



11 



3-D Hubble 
100 



Velocity [km/sl 
1001 



100 ' 



■ 10 



NIRSPEC 
No NIRSPEC 




100 ■ 



■ 10 



;With HO (default) 
No HO 



100 ' 



■ 10 



0.1 1.0 

3-D Hubble Distance [pMpc] 



0.1 1.0 

3-D Hubble Distance [pMpc] 



0.1 1.0 

3-D Hubble Distance [pMpc] 



Figure 8. Similar to Figure [7j but: the left panel shows the differences arising when using only higher or lower S/N QSO spectra; the middle panel shows the 
results for the galaxy subsample with near-IR redshifts, and for the whole sample when not using redshifts measured from near-IR lines; the right panel shows 
the effect of relying solely on Ly« and neglecting higher order transitions, which otherwise allow the recovery of the optical depth in saturated Lya pixels. In all 
panels the black points have been slightly offset for clarity and the grey curve shows the result for the default sample and method. 



and higher S/N subsamples of the QSO spectra (with median 
S/N ratios of « 65 and ss 85 respectively). It appears that 
better data yields slightly more absorption at 3-D Hubble dis- 
tances of ~ 0.3 pMpc, but the differences are not significant. 

The middle panel of Figure [8] compares the subset of 71 
galaxies for which redshifts have been measured from neb- 
ular emission lines using the NIRSPEC instrument (red cir- 
cles) with the default sample (grey curve) as well as with 
the result obtained when we ignore the NIRSPEC redshifts 
and instead use Lyct emission and/or interstellar absorption 
redshifts measured from LRIS spe ctra fo r all galaxies (black 
squares). As discussed in Sections [2.1. H and appendix lAl the 
redshifts estimated from the NIRSPEC spectra have errors of 
At; 60kms~ 1 , while the redshifts estimated from LRIS 
spectra typically have Av w 130 km s -1 . In Figure[8]we see 
that the signal appears to drop slightly more steeply for the 
NIRSPEC subsample, as would be expected given the smaller 
redshift errors, but both the NIRSPEC and pure-LRIS samples 
are consistent with the default sample. 

We note that for Lyct emission and interstellar absorp- 
tion we also tr i ed us ing the redshift calibra tions from 
lAdelberger et alj d2005l) a nd ISteidel et all (120101) instea d of 
the one from lRakic et all d201 lb . For lRakic et ail (1201 ll) the 
signal tends to be slightly stronger and the bootstrap errors 
slightly smaller, but the differences are small compared with 
the errors (not shown). 

One of the advantages of the POD method is the possibil- 
ity to recover the optical depth in the saturated Lya pixels by 
using higher order Lyman lines. The right panel of Figure [8] 
shows the effect of omitting this feature of the POD method. 
The two curves are nearly identical except for the first two 
bins, — 0.18 pMpc, where the median optical depths in- 
crease from ^10 when we make use of higher order lines to 
10 4 when we do not. The latter value is not meaningful as it 
is the optical depth that we assign to pixels for which satura- 
tion prevents recovery of the optical depth (see $3). Without 
higher order lines, we cannot constrain the flux to be much 
smaller than the S/N ratio, which in our case corresponds to 
optical depths of about 4-5. 

Hence, measuring the median optical depth in the circum- 
galactic region requires the use of higher order lines, but the 
recovery of the optical depth in saturated pixels appears to be 
unimportant at large distances. This is, however, only true if 
we restrict ourselves to median statistics. As we will see in 



Sections [4. 6. 21 and !4. 6. 31 higher order lines are also crucial at 
large distances if we are interested in the PDF of pixel optical 
depths. 

4.4. Lya absorption as a function of transverse distance 

Figure [5] shows that absorption signal is smeared out over 
~ 1 pMpc (i.e. ~ 200 kms -1 ) in the LOS direction. Indeed, 
for all panels of Figure|6]the first 6 bins along the LOS (black 
squares), which correspond to velocities < 165 kms -1 , are 
consistent with each other at the la level. This velocity is 
slightly larger than the expected redshift errors and similar to 
the expected circular velocities of the halos hosting our galax- 
ies. Because there is no evidence for structure in the velocity 
direction on scales < 165 kms -1 and because the errors are 
strongly correlated for smaller velocity differences (see Ap- 
pendix |E), it makes sense to group these first 6 LOS bins to- 
gether and measure the absorption as a function of transverse 
distance. The result is shown as the blue squares in Figure|9] 
(all data from this figure is tabulated in Tabled, which shows 
the 3-D Hubble distance results for comparison (grey circles). 

Estimating the errors by bootstrapping galaxies, we find 
that excess absorption is detected with > 2a confidence over 
the full range of impact parameters. Using the more robust 
method of randomizing galaxy redshifts, we find that the sig- 
nificance is at least 2a for all but the second point (0.13— 
0.18 pMpc), which is based on only 8 galaxies and for which 
the significance is only 1.6cr. 

As expected, the Hubble and transverse results con- 
verge at large distances, where redshift space distortions are 
smal0. They also agree at very small distances (first bin; 
< 0.13 pMpc), because a small 3-D Hubble distance implies 
that the transverse distance must also be small (note that the 
reverse is not true since galaxies with small impact parame- 
ters contribute pixels with large LOS separations). On inter- 
mediate scales, however, the two measures of distance yield 
significantly different results, with the absorption at fixed 3- 
D Hubble distance being stronger than that at the same fixed 
transverse distance. In particular, while the transverse direc- 
tion shows a rapid fall off at 6 ~ 0.1 pMpc followed by a 
constant excess absorption out to 2 pMpc, the absorption de- 

3 Note that in the last bin the absorption is slightly stronger in the trans- 
verse direction. This is a weaker version of the distortion that we attributed to 
the Kaiser effect when we compared the transverse and LOS directions (see 
Fig.|6). 
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Figure 9. Median (logioTL ya ) as a function of two different measures of 
the distance from galaxies. The blue points, which have been offset slightly 
for clarity, show the result as a function of transverse distance for LOS sepa- 
rations < 165 km s — 1 (i.e. 0.71 pMpc for pure Hubble flow), which is about 
the scale over which trends in the LOS direction are smoothed out (see Fig. [6} 
and over which errors are correlated (see Appendix|B}. For comparison, the 
grey points repeat the Hubble distance results shown in Fig. [7] The results 
for the true, 3-D distance are not measurable, but are expected to lie in be- 
tween the transverse and Hubble results shown here. Data from this figure is 
tabulated in Table|5] 

creases smoothly with 3-D Hubble distance. 

Because of redshift space distortions, it is not possible to 
measure the absorption as a function of the true, 3-D dis- 
tance. However, we expect the true result to be in between 
the transverse and Hubble results shown in Figure|9] The 3-D 
Hubble distance may overestimate the true distance on inter- 
mediate scales, because it assumes all velocity differences to 
be due to the Hubble flow, whereas in reality redshift errors 
and peculiar velocities will contribute. Note, however, that 
infall of the right magnitude could lead the 3-D Hubble dis- 
tances to underestimate the true distances. On the other hand, 
because we group LOS separations of — 165 kins" 1 , the 
transverse distance will typically underestimate the true dis- 
tance on intermediate scales because it implicitly assumes that 
the contribution of the Hubble flow is negligible up to velocity 
differences of 165 kms -1 . 

However, we chose to average over this velocity difference 
with good reason: within 165 kms -1 the signal is indepen- 
dent of velocity (see Fig. |6j and the errors are strongly corre- 
lated (AppendixlBj, which suggests that the velocities are not 
dominated by the Hubble flow. We therefore expect the trans- 
verse results to be closer to the truth, i.e. to better represent 
the real space absorption profile, and the excess absorption 
for 3-D Hubble distances ~ 200 pkpc to be due mostly to the 
inclusion of absorption around galaxies with smaller impact 
parameters. Indeed, we find that around these distances the 
scatter in the PODs is much greater if we bin in terms of 3-D 
Hubble distance than if we bin in terms of transverse distance 
(even though a velocity interval of 165 km s _1 corresponds to 
a relatively large Hubble distance of 710 pkpc). 

4.4.1. Scatter 

We have so far focused on the median absorption. How- 
ever, the dashed curves in Figure [7] demonstrate that there 
is a large degree of scatter in the distribution of PODs at a 



Table 3 

Data from Figure [9] 



Distance" [pMpc] Median loginTXya Median logioTL yo 



0.00-0.13 
0.13-0.18 
0.18-0.25 
0.25-0.36 
0.36-0.50 
0.50-0.71 
0.71-1.00 
1.00-1.42 
1.42-2.00 
2.00-2.83 
2.83-3.99 
3.99-5.64 



2.84 
0.19 
+0.27 
0.34 
+0.22 
0.43 
+0.15 
0.20 

uoy 0.18 
0.10 
0.0!) 
0.07 
0.07 
0.04 
0.04 
+0.03 
0.03 
+0.01 
0.03 

L ' M 0.03 
0.1)2 



1.16 
0.74 
0.07 
-0.37 



-0.75 
-0.82 
-0.91 
-1.05 
-1.14 



0.85 
-0.81 
-0.68_ 
-0.78] 
-0.87 
-0.79 
-0.89 
-0.93 
-0.98 



+0.30 

-0.37 
+0.22 
-0.18 
+ 0.33 



0.22 
0.22 
-0.16 
-0.10 
-0.11 
-0.00 
-0.08 
-0.07 
-0.05 
+0.07 
0.07 



-1.27" 



0.01 



a Distance bins in pMpc. 

k The median logioTL ya as a function of 3-D distance, with the \a confi- 
dence interval. 

c The median logioTL ya a function of transverse distance for LOS separa- 
tions < 165 kms -1 (i.e. 0.71 pMpc for pure Hubble flow), with the 1<t 
confidence interval. 

fixed distance. In this section we will therefore investigate 
how the median absorption around individual galaxies varies 
from galaxy to galaxjy. 

Figure [TU] shows histograms of the distribution of the me- 
dian optical depth in velocity intervals of ±165 kms -1 cen- 
tered on galaxies. The different panels correspond to differ- 
ent impact parameters. The x-axis has — oo as its lower limit. 
This indicates pixels set to r = 10~ 5 , because they had F > 1 
(see Section[3]for details). We expect the true optical depths 
of these pixels to be similar to the inverse of the S/N ratio, 
which is ~ 10~ 2 . The label "saturated" at the high-absorption 
end of the x-axis is for saturated pixels whose optical depth 
could not be recovered using higher order Lyman lines (be- 
cause they are either also saturated or unavailable). These 
pixels do not necessarily have optical depths that are higher 
than the highest recovered values (~ 10 2 ), because higher or- 
der lines can be either contaminated or unavailable. In that 
case we cannot constrain the flux to be smaller than the S/N 
ratio, which corresponds to optical depths of about 4-5. 

Comparing the different panels with each other, it seems 
that the distribution is shifted to higher optical depths for 
b < 0.13 pMpc, but that the results are similar for impact 
parameters 0.13 < b < 2.0 pMpc. Indeed, Kolmogorov- 
Smirnov (K-S) tests show that, except for the first panel, pan- 
els 2-8 are consistent with the last panel (at the ler level). For 
b < 0.13 pMpc, however, the distribution of median optical 
depths differs at the 99.98% confidence level (3.7c). These 
results are consistent with our findings for the medians of the 
distributions (Fig.|9j blue squares). 

Ignoring the outliers for which the median optical depth is 
saturated or — oo, the scatter is about 0.75 dex. The median 
absorption near galaxies with a fixed impact parameter is thus 
highly variable, suggesting that the gas is clumpy. This is 
true at all impact parameters and therefore not a distinguish- 
ing feature of the circumgalactic medium. 

The blue curve, which is repeated in every panel, indicates 

4 We could also have studied the distribution of individual PODs, but we 
prefer to consider only one data point per galaxy (i.e. the median POD within 
165 kms -1 ) because galaxies are independent, whereas pixels are not. 
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the distribution after randomizing the galaxy redshifts. It 
therefore gives the distribution that we would expect to mea- 
sure if the absorption were uncorrelated with galaxy positions. 
Using the K-S test to compare each histogram with the blue 
curve, we find that except for the second panel (which con- 
tains only 8 galaxies), all are discrepant at the 2a to 6a level. 
For b > 0.5 pMpc the excess absorption appears small, but 
the difference is in each case significant at the > 4cr level. For 
the second panel (b = 0.13 — 0.18 pMpc) the significance of 
the detection of excess absorption is < la. These results are 
in near perfect agreement with those based on median statis- 
tics. Thus, examination of the full distribution of median op- 
tical depths around galaxies confirms the result obtained for 
the medians. There is a sharp drop in the absorption around 
10 2 pkpc, which is similar to the virial radius, followed by a 
near constant, small, but highly significant, excess absorption 
out to transverse separations of at least 2 pMpc. 

4.5. Interpreting PODs 

The central optical depth is related to the absorbing gas col- 
umn density, N, through the following approximate relation: 
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where / is the oscillator strength, A is the transition's rest 
wavelength, b D = v^rms = (FWHM) /2 v / ln2 is the line 
width, and vrms is the line of sight velocity dispersion (e.g. 
Padmanabhan 2002, §9.5). In our study we use s tatistics 
based on either median optical depths or (in §14. 6. 31 on the 
maximum optical depth within a given distance range from a 
galaxy. Measurements based on the maximum optical depth 
in a given region are relatively easy to interpret using Equa- 
tion 0. However, given that most of our analysis is based 
on median optical depths, all the conversions that we make to 
column densities of the absorbing gas are likely to be under- 
estimates if most of the lines are not blended, and could be 
overestimates in regions where the lines are highly blended. 
For example, if there is a single line in a given velocity in- 
terval, the column density estimated from the median optical 
depth would be lower than the real column density of such a 
system. If, on the other hand, there is one strong line and a 
number of weaker lines that are blended with it, then the col- 
umn density inferred from the median optical depth could be 
higher than the median column density of individual absorp- 
tion systems. 

The left panel of FigureQT]shows the median neutral hydro- 
gen column density as a function of both transverse distance 
for LOS velocities 0— 165 km s _1 (solid curve) and 3-D Hub- 
ble distance (dashed curve), which we obtained from Figure|9] 
and equation (0, using the typical line width of 26kms -1 
measured for our sample by Rudie gtal] (120121) . The median 
column density decreases from ~ 10 145 cm~ 2 at ~ 10 2 pkpc 
to ~ 10 12 7 cm -2 at ~ 1 p Mpc, which i s in excellent agree- 
ment with results from Rudie et al. (2012) based on Voigt pro- 
file decompositions. Note that if we had selected the strongest 
system within a given distance of each galaxy, instead of tak- 
ing into account all pixels within that distance, the value of 
the median column density (i.e., the median of the maximum 
column density) would have been significantly higher. 



To gain intuition about what the observed absorption repre- 
sents, we will convert the optical depths into overdensities us- 
ing two ap proximations. C ombining Equation (0 with Equa- 
tion (10) of Schave (2001), who treats Lya absorbers as grav- 
itationally confined gas clouds with sizes of order the local 
Jeans length, we obtain: 



A fa 9 It 2 / 3 V 2 ^ 3 
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(8) 



where A = p/(p) is the density of gas in units of the mean 
baryon density of the Universe, T12 is the photo-ionization 
rate in units of 10 12 s _1 , and / g denotes the fraction of the 
cloud mass in gas. We have assumed a temperature typical 
for the moderately overdense IGM ( e.g. iSchave et al.l l2000a: 
iLidz et al]l2010t iBecker et alj|201 ll) . a line width consis tent 
with the median value measured by iRudie et ail (120121) for 
our data, and a photo-ionization rate appropriate for ioniza- 
tion by the ultraviolet background r adiation (e.g. Bolt on et alj 
I2005t iFaucher-Giguere et alj|2008l) . In collapsed gas clouds 
/ g could be close to unity, but far away from galaxies gas 
will not be in dense clumps and / g should be close to its uni- 
versal value of fib/fim. Note that these densities are effec- 
tively evaluated on the local Jeans scale, w hich is typica lly 
~ 10 2 pkpc for the densities of interest here (Schave 2001). 

The above expression is a good approximation for over- 
dense absorbers. Absorbers with densities around or below 
the cosmic mean have n ot had suffici ent time to reach local 
hydrostatic equilibrium (Schave 2001) and will be better de- 
scribed by the fl uctuating Gunn-Peterson approximation (e.g. 
iRauch et alii 19971) . which assumes smoothly varying density 
fluctuations and pure Hubble flow, yielding 
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3.36 



-9/4 
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Both equations (0 and (0 assume primordial abundances, 
highly ionized gas, and photo-ionization equilibrium. How- 
ever, close to galaxies, UV radiation from local sources may 
dominate over the background and the gas may be shock- 
heated to temperatures sufficiently high for collisional ion- 
ization to dominate. Both of these effects would cause us to 
underestimate the gas density, possibly by a large factor. 

The middle and right panels of Figure QT| show the me- 
dian overdensity profiles obtained after applying the above 
equations to the median optical depth measured as a func- 
tion of 3-D Hubble and transverse distance (for LOS separa- 
tions — 165 kms -1 ), respectively. The "Jeans" and fluc- 
tuating Gunn-Peterson approximation are generally in good 
agreement, although the former yields steeper density profiles 
at small distances, where the gas is highly overdense. How- 
ever, in this regime we do not expect the fluctuating Gunn- 
Peterson approximation to hold. The dashed curve shows the 
result if we convert the recovered optical depths into over- 
densities using the fi t to hydrodynamical simulations given in 
Aguir re et alj (120021) for z — 2.5. The relation provided by 
those authors was obtained by producing mock Lya absorp- 
tion spectra for sight lines through a hydrodynamical simu- 
lation, scaling the simulated spectra to fit the observed mean 
flux decrement, and then fitting the relation between the Lya 
optical depth-weighted overdensity of gas responsible for the 
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Figure 10. The histograms show the distribution of the median HI Lye* optical depth within 165 km s — 1 from galaxies. Different panels correspond to different 
impact parameters in the range 0-2 pMpc, as indicated above each panel. The blue curve, which is repeated in each panel, shows the distribution after randomizing 
the galaxy redshifts, i.e., the distribution expected in the absence of absorption correlated with galaxy positions. There is strong (> 2 — 6c) evidence for excess 
absorption at all impact parameters, but there is no evidence for a trend with impact parameter beyond 0.13 pMpc. 




Figure 11. Inferred median neutral hydrogen column density (left panel) and inferred median overdensity (middle and right panels) as a function of distance 
from galaxies. Column densities were obtained from the measured, median Lya optical depths and equation Q. Overdensities were obtained by converting the 
median optical depths using the Schave ( 2001) " Jeans" approxim ation (solid curves), the fluctuating Gunn-Peterson approximation (dotted lines), or the fit to 
the results of the hydrodynamic simulation of Aeuirre et al. 12002) (dashed curves). Overdensities inferred from observed optical depths are effectively volume 
weighted (and are lower than mass-weighted overdensities). The right panel and the solid curve in the left panel show results as a function of transverse distance 
for velocity separations < 165 kms -1 , while the middle panel and the dashed curve in the left panel are for 3-D Hubble distances. Typical column densities 
(overdensities) decrease from ~ 10 cm - 2 (~ 10) at ~ 10" 1 pMpc to < 10 13 cm" 2 (< 1) at > 1 pMpc. 
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absorption in each pixel and the recovered Lya POD in that 
pixel. The agreement between the hydrodynamical simulation 
and the Jeans method is clearly excellent. 

We conclude that typical gas overdensities decrease from 
> 10 at distances < 10 2 pkpc, which is similar to the virial 
radii of the halos hosting our galaxies, to ~ 1 at ~ 1 pMpc. 
The steepness of the drop at intermediate scales is uncertain 
due to redshift space distortions, but it is likely to be bracketed 
by the the middle and right panels of Figure [TT] Observe that 
on large scales the overdensity asymptotes to values smaller 
than unity. This is expected, as the median optical depth is a 
volume weighted measure rather than a mass-weighted mea- 
sure. While the mass-weighted mean overdensity is by defini- 
tion unity, the volume-weighted mean overdensity is smaller 
because underdense regions dominate the volume. As most 
of the pathlength in the Lya forest is across voids, the median 
density inferred from the Lya forest is less than 1 . The dif- 
ference between volume and mass-weighted quantities proba- 
bly also accounts for the relatively low overdensity measured 
around the virial radius (~ 10 _1 pMpc). 

4.6. Circumgalactic matter 

The smallest impact parameter in our sample is ss 55 
pkpc. It is difficult to probe scales smaller than this with 
QSO-galaxy pairs owing to the small number of bright QSOs 
and the rarity of close pairs, as well as to the difficulty of 
observing objects right in front of a bright QSO. For such 
small scales it is therefore more effi cient to resort to using 
galaxies as background objects (e.g. j Adelberger et alj 120051 : 
iRubin et all 120101: ISteidel et al.ll2010l) . Nevertheless, in this 
section we will use our sample of background QSOs to study 
the gas within transverse distances of 200 pkpc of galaxies. 
But before doing so, we will compare o ur results with the 
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small-scale data from Stei del et a, 

4.6. 1 . Comparison with results for galaxy-galaxy pairs 

ISteidel et alJ (1201 Oh measured Lya rest-frame equivalent 
width (EW) as a function of distance from the same sample 
of galaxies that we study here. Since they used background 
galaxy spectra for probing foreground galaxies' circumgalac- 
tic matter, they were able to study smaller scales than we can 
probe with the QSO spectra. While they could measure the 
EW in a stack of background galaxy spectra, the spectral res- 
olution of the galaxy spectra was too low to measure column 
densities or resolved pixel optical depths and the S/N was too 
low to obtain EW measurements for individual galaxies. The 
red circles in Figure [12] show their measurements at small im- 
pact parameters and the blue circles show their measurements 
using QSO spectra, together with our EW measurements from 
QSO spectra at larger impact parameters (black squares). Al- 
though we used the same d ata, their technique for measuring 
EW differs from ours (see ISteidel etaill2010l for more de- 
tails). 

We computed the EWs as follows. We shifted the spec- 
tra into the rest-frame of each galaxy within a given impact 
parameter bin, found the mean flux profile, divided it by the 
mean flux level of all pixels in the spectra (i.e. 0.804) in order 
to mimic the effect of continuum fitting low-quality spectra, 
and integrated the flux decrement over ±500 km s _1 centered 
on the galaxies' positions. We verified that using a larger ve- 
locity interval (< 1000 km s _1 ) gives consistent results. 

It appears that the absorption strength falls off according to 
a power law, out to ~ 800 pkpc. Beyond this impact parame- 
ter the relation flattens off at EW— 0.2 - 0.3 A. 




0.01 0.10 1.00 

Impact parameter [Mpc] 

Figure 12. Rest-frame equivalent width of Lya absorption as a function 
of imp act parameter, for the galaxy-galaxy pair samples from Ste idel et alj 
12010) shown as red circles, and as measured from the data set used in this 
paper, QSO-galaxy pairs, as grey squ ares. The blue circles show QSO- 
galaxy pair results from Steidel et al. 1 2010), using the same KBSS data, 
but a different way of measuring EW and different bin sizes.) The verti- 
cal eiTor bars are estimated by bootstrap resampling the galaxy sample. The 
horizontal error bars indicate bin sizes. The grey line shows the best fit: 
EW= O.f 1&~ 0,76 + 0.25, where b is impact parameter of galaxies. 

4.6.2. Scatter 



Adelberger et al. (2005) found that the absorption near 
galaxies follows a bimodal distribution. While the flux decre- 
ment is usually large, w 1/3 of galaxies are not associated 
with strong Lya absorption within a 3-D Hubble distance of 
1 hr 1 pMpc (i.e. w 400 pkpc). 

As shown in Figure [10] we also find that the absorption 
varies strongly from galaxy to galaxy. However, we showed 
that this is not only true near galaxies, but also if we look in 
random places of the spectrum. Moreover, the distribution of 
medi an optical depths i s not b imodal. 

As|Adelberger et al] (120051) did not measure optical depths, 
we show the distribution of the median flux decrement for 
impact parameters < 200 pkpc and velocity differences < 
165 kms -1 from galaxies in the left panel of Figure Qj] For 
comparison, the right panel shows the corresponding median 
optical depths. As before, we chose this velocity interval be- 
cause there is little structure in the median profiles for velocity 
separations smaller than this value (see Figs. [5] and [5), which 
is not surprising given that 165 kms -1 is similar to both the 
redshift errors and the circular velocities of the halos thought 
to host the galaxies. The blue curves show the results for ran- 
domized galaxy redshifts, where we assign random redshifts 
(within the Lya forest redshift range) to the galaxies in our 
sample. The curves are the result of estimating a histogram 
for randomized redshifts 1,000 times, and taking the mean 
median in each bin. A K-S test shows that the two distribu- 
tions are discrepant at the > 6er level, so the absorption is 
clearly enhanced near galaxies. 

It can be seen in the left panel that galaxies either show 
very little transmission, or relatively high flux. The flux dis- 
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Figure 13. Number of galaxies vs. median flux (left panel) and median logioTL ycl (right panel) within 165 kms' 1 from galaxies with impact parameters 
b < 200 pkpc from galaxies. The blue curves show the results expected if absorbers and galaxy positions were uncorrected, and were obtained after randomising 
the galaxy redshifts 1,000 times. In the right panel — oo indicates the pixels with F > 1, and "saturated" indicates saturated pixels for which the optical depth 
could not be recovered (see text for more details). The observed absorption is enhanced at the > 6cr level. 



tributi on is thus bimodal, in agreement with Adelberger et al. 
(2005). However, the right panel shows that there is no evi- 
dence for bimodality of the optical depth distribution. Hence, 
the bimodality seen in the left panel is a consequence of the 
mapping F = exp(— r) which bunches the low (high) optical 
depth tail of the distribution together at a flux decrement of 
zero (one). Note that the shape of the optical depth distribu- 
tion (which appears to be approximately lognormal) is phys- 
ically more relevant than that of the flux, because the former 
is proportional to the neutral hydrogen column density. 

We will demonstrate next that the scatter is not random as 
the optical depth is strongly anti-correlated with the impact 
parameter. 

4.6.3. Coldflows 

In recent years cosmological simulations and theoretical 
models have converged on the idea that gas accretion is bi- 
modal. While the gas accreting onto massive halos is shock- 
heated to the virial temperature, in lower mass halos most 
of the gas falls in cold (T ~ 10 4 K). The cold mode feeds 
the galaxy through filaments (cold streams) whereas the hot 
mode results in the formation of a hydrostatic halo which 
fuels the galaxy through a cooling flow (e.g. iKeres et al] 
20051 lOcvrrk et al Il2008t iDekel et al.ll2009HCrain et al.ll2010t 
van de Voort et alj 1201 lalfbt IFaucher-Giguere et alj 1201 lbl) . 
At z = 2 halo masses of ~ 10 12 M , i.e. similar to those 
hosting our galaxies (Trainor et al. 2011, in preparation), 
are especially interesting in that they mark the transition be- 
tween the cold- and hot-mode a ccretion regimes. For ex- 
ample, Ivan de Voort et al.l d201 lal) predict that the hot mode 
contributes on average « 50% and s» 30% to the growth of 
10 12 M halos and their central galaxies, respectively. The 
simulations predict that individual galaxies in such halos are 
fed simultaneously through both modes with cold streams 
penetrating hot, hydrostatic halos. 

Despite the theoretical consensus, there is no direct obser- 
vational evidence for this galaxy formation picture as of yet. 
Absorption by neutral hydrogen is a promising way to ob- 
serve the cold streams. Based on a high-resolution simula- 
tion of a single halo with mass 3 x 10 11 M Q at z = 2, 



IFaucher-Giguere & Keresl (1201 lal) predict that the covering 
fraction of cold flows with Lyman Limit (LLS; 10 17 2 < 
Nhx < 2 x 10 20 cm~ 2 ) and DLA column densities (JVhi > 
2 x 10 20 cm -2 ), within the virial radius (2 virial radii) is 10— 
15% (4%) and 3^4% (1-2%), r espectively, where t hey quote 
74 pkpc for the virial radius. iKimm etafl (1201 ll) use cos- 
mological simulations to predict that the covering fraction of 
systems with Nm > 10 20 cm~ 2 withi n 100 pkpc of halos 
of - 10 12 M Q at z = 2.5 is w 5%. [Stewart et al] (120111) 
simulate two halos with masses of ~ 10 115 M Q at z = 2 
and predict coveri ng fractions for Nm > 10 16 cm -2 of about 
15% at 33 pkpc. Fumag alli et al.l (1201 ll) study the covering 
fraction of cold streams around 7 haloes. For two haloes with 
virial masses of 5.9 x 10 n M o and 9.2 x lO n M atz = 2.33, 
which are typical for star-forming galaxies from our sample, 
they predict a covering fraction for DLAs (LLSs) within 2R V i r 
of 2.22 (6.89) and 1.54 (9.11) per cent, respectively. 

The left panel of FigureQ~4]shows the median optical depth 
within ±165 kms -1 from galaxies, as a function of the im- 
pact parameter. The results are insensitive to the exact ve- 
locity interval chosen, although a galaxy with b = 155 pkpc 
would have been a DLA system if we had used a maximum 
velocity separation > 200 kins" 1 . Each circle represents a 
galaxy. Sub-DLA systems are shown as star symbols. In two 
non-DLA cases the median optical depth could not be mea- 
sured because more than half of the pixels were saturated and 
there were insufficient higher order lines to recover the opti- 
cal depth. For those cases we plot the largest recovered opti- 
cal depth and show it as an upwards-pointing arrow to indicate 
that it is a lower limit. Note, however, that there is no evidence 
for DLA absorption associated with any of the galaxies plot- 
ted as lower limits. The vertical dashed line shows the virial 
radius for halos with mass M = 10 9 M (Trainor et al. 
2012). To compare with predictions from the literature, it is 
more appropriate to look at the maximum pixel optical depth 
in the ±165 kms" 1 interval around each galaxy, because we 
can convert this into a neutral hydrogen column density using 
equation (0. This results in significantly more lower limits, 
as can be seen in the middle panel of Figure [14] Finally, the 
right panel shows the minimum flux. 
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Figure 14. The left and middle panels show, respectively, the median and the maximum optical depth within 165 kms 1 from each galaxy that is separated 
from the LOS by less than 200 pkpc as a function of its impact parameter. In cases where the median/maximum corresponded with a saturated pixel without 
an optical depth estimate we plot the largest recovered value of the POD within 165 kms -1 as an upward pointing arrow - these are lower limits to the actual 
optical depths. The dotted horizontal line indicates the approximate threshold for Lyman limit absorbers. Stars indicate a sub-DLA, identified via its damping 
wings (and with line centers < 165 kms from the nearest galaxy). The right panel shows the minimum flux within 165 kms -1 from each galaxy. The 
vertical dashed lines indicate the virial radius for typical halos hosting the galaxies in our sample, with M h = 10 119 Mq at z = 2.4. 



Figure [14] shows a clear trend of increasing absorption 
strength with decreasing distance. While the scatter in the 
optical depth is large at all impact parameters, in terms of flux 
there is actually little variation within 100 pkpc. In particu- 
lar, it is striking that all 10 galaxies with impact parameters 
smaller than 100 pkpc are associated with saturated absorbers 
(FwO). 

Within 100 pkpc 10 out of 10 galaxies have median tl vq > 
1, which corresponds to a covering fraction of 100^ 2 %. 
Within 200 pkpc this decreases to 66 ± 16%. If we use the 
maximum optical depth then the covering fraction of TL ya > 
1 absorbers within 200 pkpc increases to 89j^g%. Note that 
the central optical depth is unity for lines with column density 
TVhi ~ 4 x 10 13 cm -2 (see equation|7). 

It appears that 1-5 out of 6 galaxies with impact parameters 
smaller than the virial radius of their host halos are associated 
with LLSs (the horizontal dashed line shows the central opti- 
cal depth for a TVjji = 10 17 " 2 cm~ 2 absorber assuming a line 
width of 26 km s _1 ), i.e. there is one secure LLS and 4 possi- 
ble LLSs given that we have only lower limits on the optical 
depth of these absorbers. In addition, 1 out of 6 galaxies has a 
sub-DLA within i? v ir- Within two virial radii 1-15 out of 21 
galaxies are associated with LL systems and 1 out of 21 galax- 
ies is associated with a sub-DLA (in addition there would have 
been 1 DLA if we had used a maximum velocity separation 
> 200 k ms" 1 ). These numbers are higher tha n the predic- 
tions f rom lFaucher-Giguere"&^ Keres (201 la) and Kim m et ail 
(120111) . and also higher than iFumagalli et a!T(l201 ll) forDLAs 
(they are consistent with their results for LLSs), but the errors 
are too large for the discrepancy to be significant. We note that 
when it comes to measuring the column densities of strongly 
saturated systems (e.g. Figure [141 where our method yields a 
number of lower limits), it would be more appro priate to use 
Voigt p rofile decompositions, as are presented by Rud ie et al.1 
(2012). This other method gives results fully consistent with 
those presented here. 

Even though these observations are consistent with what is 
seen in simulations, it is unclear w hether the observed absorb- 
ing gas is inflowing or outflowing. Steid el et al.l d2010) found, 
using 500 galaxy pairs sampling angular scales of 1-15", that 
the circumgalactic medium within s» 80 — 90 pkpc shows 
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Figure 15. The median 3-D Hubble distance to the nearest galaxy in our 
sample, normalized by the median 3-D Hubble distance to the nearest galaxy 
for all pixels, as a function of the pixel's Lya optical depth. We normalize 
by this distance in order to get results that do not depend on the galaxy sam- 
ple's completeness. The unnormalized distance scale is shown on the right 
y-axis. The grey shaded area shows the lcr confidence interval around the 
results obtained when using randomized galaxy redshifts. Absorbers with 
logio(TL va ) > —1.0 tend to be closer to galaxies than a random place in 
the Universe is, while lower optical depth pixels tend to be further away. 

strong H I and low-ion metallic absorption, which is consis- 
tent with the findings in this paper. However, they also find 
that galaxy spectra exhibit kinematics consistent with radial 
flows with velocity increasing outward. 

5. THE DISTRIBUTION OF GALAXIES AROUND ABSORBERS 

In the previous sections we investigated the typical IGM 
environment of the galaxies in our sample. Here we address 
the complementary question: what is the galaxy environment 
for a pixel of a given optical depth? 

Figure [15] shows the median 3-D Hubble distance to the 
nearest galaxy in our sample as a function of a pixel's Lya 
optical depth, normalized by the median 3-D Hubble distance 
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to the nearest galaxy for all pixels (2.88 pMpc), irrespective 
of their optical depth (the unnormalized distance is shown on 
the right r/-axis). Note that because we divided by the me- 
dian distance for all pixels, the result is not directly dependent 
on the completeness of our sample (the actual, unnormalized 
distances are, however, highly sensitive to the completeness). 
The error bars were determined by bootstrap resampling the 
QSO spectra using chunks of 500 pixels. The grey shaded 
area shows the la confidence interval of results obtained for 
randomized galaxy samples. It shows what would be expected 
if absorbers and galaxies were distributed randomly with re- 
spect to each other. 

We cut the plot at the low optical depth end at the value 
corresponding to a 2cr detection of absorption for a S/N of 50. 
Noise prevents us from distinguishing lower optical depths 
from each other, causing the trend to flatten. At the high 
optical depth end we cut the plot at TL yQ = 10. Although 
we can recover higher optical depths, we find that pixels with 
Thy a > 10 are biased: while the median redshift stays close 
to 2.36 for optical depths in the plotted range, it increases 
to about 2.5 for higher optical depths. Because of our se- 
lection function, the galaxy number density is lower for such 
high redshifts, causing an upturn of the distance to the nearest 
galaxy for t > 10. 

The optical depth is strongly anti-correlated with the dis- 
tance to the nearest galaxy. Pixels with r > 0.1, are closer 
to galaxies than a random place in the Universe is, while pix- 
els with r < 0.1 are on average farther from galaxies than a 
random location is. 

To provide some physical interpretation of this obser ved 
trend we use the Jeans approximation described in Section l431 
to convert optical depths into estimates for the gas overdensi- 
ties (implicitly smoothed on the scale of the absorbers, i.e. 
~ 10 2 pkpc) (top x-axis). Gas at the mean density of the 
Universe would produce logioTL yQ ~ —0.65 and is thus typ- 
ically closer to galaxies than a random place in the Universe. 
Although it is unclear whether we can estimate the overden- 
sity sufficiently accurately for this statement to be reliable, it 
would in fact not be a surprising result. Because voids take 
up most of the volume, a random place in the universe will be 
underdense. 

If we cube the normalized distance to the nearest galaxy, 
then we get something close to the inverse of the overden- 
sity of galaxies. However, in that case the scale over which 
this number density is measured, would be the distance to the 
nearest galaxy and would thus vary with the density itself. It 
is more instructive to measure the overdensity of galaxies on 
a fixed length scale, which is something we can also measure 
from our data. 

In Figure [16] we show the mean overdensity of galaxies in 
our sample as a function of pixel optical depth, evaluated on 
(3-D Hubble) scales of 0.25, 0.5, 1, 2, 4, and 8 pMpc around 
them (tabulated in Table [4]). Note that these galaxy number 
densities cannot be directly compared with the gas overdensi- 
ties that we can estimate from the optical depths because they 
are evaluated over different length scales. 

The galaxy number density increases with increasing opti- 
cal depth, and the density contrast is stronger when it is mea- 
sured on smaller scales. For example, pixels with tl vq ~ 10 
(at z Ri 2.4) see on average a galaxy overdensity w 2 within 8 
pMpc, but an overdensity ~ 10 within 0.25 pMpc. Pixels with 
r Lya ~ 1 typically reside in regions where the galaxy number 
density is close to the cosmic mean on scales > 0.25 pMpc. 
Because galaxies are clustered, this implies that for such pix- 
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Figure 16. The mean overdensity of galaxies in our sample within 0.25 
(green diamonds), 0.5 (pink squares), 1 (orange triangles), 2 (red diamonds), 
4 (blue asterisks), and 8 (black crosses) 3-D Hubble pMpc from pixels of a 
given Lya optical depth. The mean galaxy density is higher near stronger 
absorbers, and the density contrast is more pronounced when estimated on 
smaller scales. The data from this figure is tabulated in Tablef4] 

els the distance to the nearest galaxy will typically be smaller 
than it is for a random place in the Universe (see Fig . |T5j) . 

Given that we have both measurements of the galaxy num- 
ber density and estimates of the gas overdensities, we could 
attempt to estimate the bias of these two components rela- 
tive to each other by computing the ratio of the two den- 
sities. However, the length scale over which gas densities 
corresponding to Lya absorbers are implicitly smoothed (i.e . 
the local Jeans scale in case of overdense gas: ISchaya :2001). 
varies with the density and hence the optical depth, while 
galaxy and gas density have to be evaluated on the same scale 
in order to calculate the relative bias. We could in princi- 
ple estimate the galaxy overdensity on scales that vary with 
the optical depth, but for overdense gas the relevant scales 
are somewhat too small to get a robust estimate of the galaxy 
number density. 

6. SUMMARY & CONCLUSIONS 

In this paper we studied the galaxy-IGM interface at z rj 
2 — 2.8 in the 15 fields of the Keck Baryonic Structure Sur- 
vey. Each KBSS field is centered on a very bright background 
QSO of which high S/N, high resolution spectra have been 
obtained, and includes a densely-sampled galaxy survey op- 
timized to sample the same range of redshifts probed by the 
IGM measurements. Our analysis has included a total of 679 
galaxies, the subset of the larger KBSS sample that lies in 
the appropriate range of redshifts within 2pMpc of the cor- 
responding QSO sightline. The redshifts of 71 of our galax- 
ies were measured directly from nebular lines observed in the 
near-IR, while the remaining redshifts were derived from ISM 
absorp tion and/or Lya em ission lines and the redshift calibra- 
tion of Ra kic et all (1201 ll) . The median redshift of our galax- 
ies is 2.36. 

We analyzed the spectra using the pixel optical depth 
method, which uses an automatic algorithm to recover the 
Lya optical depth in each pixel, making use of higher order 
Lyman lines to recover the optical depth in pixels with satu- 
rated Lya absorption. Our analysis relied almost entirely on 
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Table 4 

Data from Figure [16] 



loglOTLyc/ A gi 8pMpc b A gi 4 p Mpc° A g> 2pMpc d A gi i p Mpc C A gj o.5pMpc f A g ,o.25pMpc 



-1.39 - -1.21 
-1.21 --1.02 
-1.02- -0.84 
-0.84 - -0.65 
-0.65 - -0.47 
-0.47 - -0.29 
-0.29- -0.10 
-0.10-0.08 
0.08 - 0.26 
0.26 - 0.45 
0.45 - 0.63 
0.63-0.82 
0.82-1.00 



0.95±0.02 
0.98±0.02 
1.01±0.02 
1.01±0.02 
1.02±0.02 
1.07±0.03 
1.08±0.03 
1.07±0.03 
1.12±0.04 
1.19±0.06 
1.22±0.05 
1.35±0.07 
1.30±0.07 



0.91±0.02 
0.96±0.02 
1.01±0.02 
l.O3±0.03 
1.05±0.03 
1.13±0.03 
1.16±0.04 
1.17±0.04 
1.25±0.05 
1.34±0.07 
1.38±0.07 
1.50±0.10 
1.51±0.10 



0.84±0.03 
0.91±0.03 
0.95±0.03 
0.98±0.04 
1.07±0.03 
1.20±0.04 
1.28±0.05 
1.40±0.06 
1.49±0.07 
1.66±0.10 
1.72±0.10 
1.95±0.16 
1.99±0.17 



0.66±0.04 
0.76±0.05 
0.88±0.05 
0.91±0.05 
1.04±0.06 
1.25±0.07 
1.41±0.10 
1.52±0.10 
1.78±0.14 
2.19±0.19 
2.18±0.22 
2.63±0.31 
3.14±0.38 



0.60±0.08 
0.65±0.08 
0.75±0.10 
0.77±0.12 
0.86±0.12 
1.21±0.15 
1.35±0.22 
1.68±0.21 
1.89±0.28 
2.37±0.39 
2.12±0.44 
3.06±0.66 
4.09±1.02 



0.28±0.11 
0.32±0.11 
0.31±0.13 
0.47±0.23 
0.71±0.31 
0.81±0.31 
0.92±0.37 
1.01±0.37 
1.74±0.60 
2.83±0.84 
4.46±1.63 
6.77±2.07 
7.97±2.73 



a Bins in logioT LyQ . 

k The mean overdensity of galaxies in our sample within 8 3-D Hubble pMpc, with the 1 a confidence interval. 

c The mean overdensity of galaxies within 4 3-D Hubble pMpc. 

^ The mean overdensity of galaxies within 2 3-D Hubble pMpc. 

e The mean overdensity of galaxies within 1 3-D Hubble pMpc. 

^ The mean overdensity of galaxies within 0.5 3-D Hubble pMpc. 

g The mean overdensity of galaxies within 0.25 3-D Hubble pMpc. 



median statistics. Because the Lya optical depth is propor- 
tional to the column density of neutral hydrogen, its distribu- 
tion is physically more relevant than that of the flux. 

Our results can be summarized as follows (as before, all 
distances are proper): 

• We presented the first 2-D map of absorption around 
galaxies, showing the median Lya optical depth as a 
function of transverse and LOS separations. By con- 
verting velocity differences along the LOS into dis- 
tances assuming pure Hubble flow, redshift space dis- 
tortions show up as anisotropies in the absorption maps. 

• Hydrogen Lya absorption is strongly enhanced near 
galaxies. Within ~ 10 2 pkpc the median optical depth 
is > 2 orders of magnitude higher than in a random 
place (TL yQ > 10 versus < 10 _1 ). 

• Two types of anisotropies are clearly detected: 

1. Onscales- 30-200 kms" 1 , or0.1-l pMpc, the 
median absorption is significantly stronger, by ~ 
1 — 2 orders of magnitude, along the LOS than in 
the transverse direction. We showed that although 
this "finger of God" effect may be explained by 
galaxy redshift errors, it is probably dominated by 
peculiar velocities of gas within and around the 
halos. 

2. On scales of ~ 1.4 — 2.0 pMpc the absorption is 
significantly stronger (with > 3cr significance) in 
the transverse direction than along the LOS. We 
interpret this compression in the LOS direction as 
large-scale infall towards the potential wells oc- 
cupied by the galaxies, i.e. the iKaiseri (119871) ef- 
fect. To our knowledge, this is the first time that 
this signature of gravitational collapse has been 
detected for gas. 

• The median optical depth as a function of impact pa- 
rameter (over ±165 km s -1 intervals around galaxies) 
drops by more than an order of magnitude going from 



b < 0.13 pMpc to 0.13 < b < 0.18 pMpc and re- 
mains approximately constant thereafter out to at least 
2 pMpc, where it is still a factor of two higher than in 
a random part of the spectrum. The sharp transition oc- 
curs at a radius that is similar to the virial radius of the 
halos that are thought to host the galaxies. 

• The median optical depth as a function of 3-D Hubble 
distance (i.e. the 3-D distance computed under the as- 
sumption of pure Hubble flow) shows no steep drop at 
10 2 pkpc. Instead, it decreases smoothly away from 
galaxies. The lack of a steep drop reflects the fact that 
3-D Hubble distances of a few times 10 2 pkpc mix 
strong absorption arising at impact parameters < 10 2 
pkpc with the mostly weaker absorption at larger trans- 
verse separations. 

• Excess absorption is detected with > 3er significance 
out to 2.0 pMpc in the transverse direction. In terms of 
the 3-D Hubble distance the significance is > 3er out to 
2.8 pMpc and 1.5a for 2.8 to 4 pMpc. 

• The median optical depth near galaxies with a fixed im- 
pact parameter is highly variable, suggesting that the 
gas is clumpy. The scatter in the median optical depth 
at fixed impact parameter is ~ 0.75 dex. Beyond 100 
pkpc this is large compared with the excess median ab- 
sorption. 

• Converting optical depths to column densities, we es- 
timate that the median neutral hydrogen column den- 
sity decreases from > 10 5 cm~ 2 within 10 2 pkpc to 
< 10 13 cnr 2 for larger impact parameters. We note 
that if we had selected the strongest absorber within 
10 2 pkpc of each galaxy, instead of taking all the pixels 
within that distance into account, the median column 
density of so selected systems would have been sub- 
stantially higher. 

• Converting optical depths into overdensities (implic- 
itly smoothed on the scale of the absorbers, i.e. ~ 
10 2 pkpc) by applying the "Jeans" approximation 
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dSchavdl2001[). the fluctuating Gunn-Peterson approx- 
imation (e.g. lRauch et alJI 19971). or a fit to a hydrody- 
namical simulations dAguirre et al.ll2002l) all yield sim- 
ilar results, with the Jeans approximation and the hydro 
simulation agreeing particularly well. We estimate that 
median gas overdensities decrease from > 10 within 
10 2 pkpc to ~ 1 at 1 pMpc. Beyond this impact param- 
eter the overdensity asymptotes to values smaller than 
unity, which is expected because underdense regions 
dominate the volume and gas densities inferred from 
optical depths are volume rather than mass weighted. 

• The covering fraction of absorbers with median optical 
depth r LyQ > 1 within 165 kms -1 is 100+° 2 % for 
b < 100 pkpc and 66 ± 16% for b < 200 pkpc. If we 
use the maximum rather than the median optical depth 
than the latter fraction increases to 89l^g%. 

• By turning to a pixel-centric view, i.e. estimating the 
median distance to the nearest galaxy as a function of 
the pixel's optical depth, we show that absorbers with 
T Lya > 0.1 are typically closer to galaxies than a ran- 
dom location in the Universe, while weaker absorbers 
are farther from galaxies compared to random. 

• The average number density of galaxies around ab- 
sorbers increases with increasing absorption strength, 
and the galaxy density contrast is higher when it is eval- 
uated on smaller scales. For example, on scales of 0.25 
pMpc the galaxy overdensity increases from ~ 10 -0 5 
at TL yQ = 10 _1 to ~ 1 at tl vq = 1 and to ~ 10 at 
T Lya = 10. When evaluated in spheres with radius of 
2 pMpc the galaxy overdensity increases from w 0.9 at 
T Lya ~ 10 _1 to w 2 for TL yQ ~ 10. Absorbers with 
T Lya ~ 1 reside in regions where the galaxy number 
density is close to the cosmic mean on scales > 0.25 
pMpc. As most of the volume is underdense, such ab- 



sorbers are nearer to galaxies than a random place in 
the Universe is. 

This work could be improved considerably by reducing the 
redshift errors, which will be possible through observations 
of nebular emission lines with t he new multi-object near-IR 
spectrograph Keck I/MOSFIRE dMcLean et alJl2010l) . Such 
observations would also allow us to investigate whether the 
absorption varies with the properties of the galaxies such as 
their stellar masses or star formation rates. 

In a future paper we will present a comparison of our ob- 
servational findings with cosmological, hydrodynamical sim- 
ulations, which will aid the interpretation of the absorption 
profile s that we see around galaxies. In addition, Rudie et al. 
d2012l) present an analysis of the same data based on Voigt 
profile decompositions. 
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A. THE EFFECT OF REDSHIFT ERRORS 

As discussed in Section l2.1.1l there are random errors in the redshift measurements of « 130 km s _1 for those obtained from 
LRIS spectra, and sa 60 km s~ x for the NIRSPEC subsample. We expect that adding errors that are small compared to the redshift 
errors that are already present in the data will have very little effect, whereas adding errors that are large compared to the actual 
redshift errors should strongly degrade the signal. 

Figure [17] illustrates what effect the addition of random, Gaussian redshift errors has. The left and right panels show median 
optical depth profiles in cuts along the x- and y-axes of Figure [5] corresponding to the nearest LOS and transverse bins, respec- 
tively. The data points in the left (right) panel are thus identical to the red circles (black squares) of the first panel of Figure [6] 
The blue, green, red, and orange curves show the results after adding redshift errors with a = 55, 110, 165, and 495 kins -1 , 
respectively. This is done by adding a random error from a given Gaussian distribution to each galaxy redshift. Redshift errors 
smear out the signal along the LOS direction, thereby strongly reducing its amplitude. Adding 55 kms -1 errors hardly changes 
anything, but for 110 and 165 km s _1 significant differences start to appear. Errors of 495 km s _1 completely wash out the signal. 
We conclude that the true redshift errors are < 170 km s _1 . 

B. CORRELATIONS BETWEEN DATA POINTS 

Because a single galaxy contributes pixels to multiple distance bins along the LOS, we expect the errors to be correlated in the 
LOS direction. Conversely, since a single galaxy only has one impact parameter, we expect the errors to be uncorrelated in the 
direction transverse to the LOS. In this section we will show that this is indeed the case and that the errors along the LOS are 
correlated over scales of ~ 10 2 kms -1 . 

To study the correlation between points, we made 1,000 bootstrap realizations of our galaxy sample, found the median TL yQ 
in each of the distance bins for each of the realizations, and then computed the Spearman rank correlation coefficient between 
different distance bins. In other words, if we created a scatter plot of the median optical depth in bins a and b where each point 
is a bootstrap realization, then a high rank correlation would show up as points clustered along a line, whereas a correlation 
coefficient of zero would correspond to a random cloud of points in r a — Tf, space. 
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Figure 17. The effect of redshift errors. The data points with error bars in left (right) panels shows the median logioTLya as a function of transverse (LOS) 
separation for the velocity (impact parameter) range indicated in the panels. These correspond to the first panel of Fig. [6] The blue, red, and orange curves show the 
result after adding random, Gaussian redshift errors with a = 55, 110, 165, and 495 km s -1 , respectively. Significant differences start to appear for 165 kms , 
while three times greater redshift errors nearly completely wash out the signal. This suggests that the actual errors on the redshifts are < 170 kms -1 . 
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Figure 18. Correlations between errors of different data points. The data points with error bars in left (right) panels shows the median loginix™ as a function 
of transverse (LOS) separation for the velocity (impact parameter) range indicated in the panels. These correspond to the central panel of Fig. [6] The different 
colored curves and the right i/-axis show the Spearman rank correlation coefficient between the errors of different points, computed by correlating 1 ,000 different 
bootstrap samples (see text). Data points are uncorrelated in the transverse direction. Along the LOS they are strongly correlated for separations < 100 km s — 1 
but become uncorrelated for distances ^> 10 2 km s — 1 , 

We use rank correlation tests instead of the covariance matrix because some of the pixels are set to very high optical depths due 
to saturation, or to very low optical depths because they have an apparent positive flux (see Section[3]l. By using a non-parametric 
measure of the correlation strength, we are not susceptible to such problems (we use medians rather than averages for the same 
reason). 

The red circles in the left panel of Figure [18] shows the median optical depth for LOS separations 80 — 114 kms -1 (i.e. 0.36 
- 0.50 pMpc for pure Hubble flow) as a function of transverse distance. Similarly, the black squares in the right panel show the 
median optical depth for transverse distances 0.36 - 0.50 pMpc as a function of LOS separation. These points are thus identical 
to those shown in the central panel of Figure [6] 

The rest of the colored curves in Figure [18] show the rank correlation coefficient between different points, with the correspond- 
ing scale indicated on the right y-axis. Values of the correlation coefficient from 0.5 to 1 imply a strong correlation, between 
0.2 and 0.5 a weak correlation, values around imply there is no correlation, and negative values imply that there is an anti- 
correlation. 

For example, in the left panel the top red curve shows the correlation between the 4th point and all the ones that lie at larger 
distances. The rank correlation coefficient for the 4th point with itself is 1, but the correlation with the subsequent points is about 
zero. This confirms that the points are uncorrelated in the transverse direction. 
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Figure 19. Similar to Figure [T8l but showing the medi an optical depth and the rank correlation coefficient as a function of 3-D Hubble distance. The correlations 
are weaker than along the LOS (right panel of Fig. 1181 and are only substantial for adjacent bins. 

On the other hand, the right panel shows that the points are strongly correlated for LOS separations < 100 km s _1 and become 
uncorrected for distances 10 2 kins" 1 . This is consistent with the fact that we observe no evidence for structure along the 
LOS for separations < 10 2 kins -1 (see Figs. [4] and |6). 

Finally, Figure [19] shows the extent of the correlation between the errors of different points when we bin in terms of the 3-D 
Hubble distance. As the 3-D Hubble distance combines uncorrelated transverse with correlated LOS separations, the result must 
be intermediate between that for the two directions. Figure [19] demonstrates that this is indeed the case. Adjacent points are 
weakly correlated, while more distant points are essentially uncorrelated. 
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